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A.  PARSERS 


Introduction 

A special  influence  on  the  design  of  every  parser  used  in  HWIM  has 

been  the  unique  role  assigned  to  its  Syntactic  Component  (Vol.  Ill, 

Sec.  E).  HtolM's  control  strategy  requires  that  the  Syntactic  component  be 
able  to  take  any  1 sland  (i.e.,  consecutive  sequence  of  word  matches)  and 
determine  if  it  can  be  parsed  as  an  acceptable  subsequence  within  the 
grammar.  If  so,  the  Syntactic  component  must  be  able  to  return  to  Control 
a list  of  words  and  categories  that  would  form  acceptable  extensions  from 
the  island  at  either  end.  The  constraints  this  places  on  the  parser  are 
tnat,  it  must  be  able  to  start  at  any  point  in  the  input  (not  just  the  left 
end)  and  at  any  point  in  the  grammar  and  work  in  either  direction.  It  must 
be  able  to  process  islands  that  may  partially  or  fully  traverse  several 
different  levels  of  the  grammar.  Predictions  of  adjacent  words  should  be 
based  on  the  entire  context  of  the  island,  rather  than  on,  fo:  example, 
only  the  words  at  either  end.  The  latter  constraint  requires  that  the 

parser,  at  least  for  making  predictions,  must  have  a way  to  access  the 

possible  paths  through  the  grammar  within  a given  island. 

1 . Island  Parsing 

The  "island  parsing"  mechanism  to  be  described  here  supports  the 
island  driven  strategies  discussed  in  Vol.  Ill,  Sec.  E.  An  island  is  a 
single  word  or  contiguous  string  of  words  that  covers  some  part  of  an 
utterance.  The  parser  treats  each  island  as  an  entity  that  is  created 
either  as  a single  word  (i.e.,  a seed  event ) or  by  adding  a single  word  to 
an  existing  island  (i.e.,  as  a word  event ) . This  allows  Control  to  select 
the  best  available  seed  or  word  event  for  syntactic  processing  at  each 
step . 

The  parser  has  four  basic  actions.  The  first,  seed  event  processing . 
creates  a new  one-word  island  and  a representation  for  every  path  through 
that  word  that  the  grammar  allows.  The  second,  word  eve  ,t  processing,  adds 
a new  word  to  one  end  of  an  existing  island,  extending  those  paths  that 
allow  the  new  word  and  eliminating  those  paths  that  do  not.  By  this 
mechanism,  the  addition  of  a word  at  one  end  of  an  island  can  narrow  the 
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proposals  at  the  other  end.  The  third  action  of  the  parser  is  the  end 
event  proce. ssing . This  takes  place  when  an  existing  island  has  reached  one 
end  of  the  utterance.  The  parser  then  extends  the  paths  that  can  reach  the 
start  state  or  a final  state  of  the  grammar  without  using  additional  words. 
If  the  other  end  of  the  island  is  open,  the  parser  returns  a set  of 
predictions  at  that  end,  which  may  be  more  restricted  than  before;  if  that 
end  is  also  complete,  the  parser  returns  a complete  parse.  The  last  type 
of  action,  Island  colli slor,  event  proce ssing . was  added  as  an  efficiency 
mechanism  to  allow  Control  to  merge  two  islands  with  a one-word  gap  between 
them  and  a common  proposal  for  that  gap.  Island  collision  events  could 
easily  be  simulated  by  several  word  events,  but  a substantial  savings  in 
Control  and  Control-Syntax  interface  effort  is  achieved  by  use  of  this  type 
of  event.  All  four  types  of  processing  are  discussed  below. 

2,  Grammar  Structures 

a)  Description  cf  the  ATN  Machine 

In  order  to  better  characterize  the  new  parsing  system  used  in  HWIM, 

it  is  useful  to  think  of  its  grammar  as  an  "ATN  machine",  i.e.,  to  see  it 

proceduraliy  rather  than  structurally.  This  will  allow  us  to  give  a more 
concise  description  of  the  parsing  algorithm  and  show  that  it  will  produce 
the  same  results  as  ATN  parsers  described  elsewhere  [Bates,  1975b;  Burton, 
1976;  Burton  and  Woods,  1976;  Woods,  1969,  1970]. 

The  cui rent  HWIM  grammars  (see  Sec.  B1  use  only  five  different  arc 
types:  PUSH,  POP,  WRD,  CAT  and  JUMP. 

A PISH  arc  essentially  "consumes"  a non-terminal  in  the  grammar  by 
causing  a new  transition  network  to  be  entered  at  a lower  level.  If  the 
lower  transition  r ‘'work  can  successfully  accept  the  next  segment  of  the 
input  string,  it  is  exited  by  a POP  arc,  and  processing  will  continue  in 
the  transition  network  containing  the  PUSH  arc.  Each  final  state  of  a 

transition  network  will  contain  a POP  arc  that  signifies  successful 

completion  of  the  network  and  indicates  what  structure  is  to  be  returned 
for  use  by  ics  calling  network. 
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wRD  and  CAT  arcs  are  the  on 1 y terminal  consuming  (i.e.,  word 
consuming)  arcs  in  the  grammar.  That  is,  they  are  the  only  arcs  that 

advance  the  "input  pointer"  that  marks  the  current  position  in  the  input 
string.  (This  pointer  may  move  during  a PUSH  arc,  but  only  as  a result  of 
WHD  and  CAT  arcs  taken  in  the  lower  network.)  WRD  and  CAT  arcs  differ  only 
in  the  way  they  express  the  3et  of  terminals  they  can  consume.  A WHD  arc 
specifies  its  terminals  explicitly  and  exhaustively,  while  a CAT  arc 
specifies  them  imp'icitly  via  the  syntactic-semantic  category  to  which  they 
must  belong.  Since  uhe  function  of  WRD  and  CAT  arcs  in  the  ATN  system  is 
so  similar,  they  will  be  considered  the  same  for  most  of  the  functional 
description  of  the  HWIM  parser. 

A JUMP  arc  is  similar  to  a WRD  or  CAT  arc  since  it  continues  the  given 
transition  nr. work.  However,  it  does  not  consume  a terminal  in  doing  30 
and  hence  does  not  advance  the  input  pointer. 

<ATN.>  ->  (< state > <state>*) 

<state>  ->  ( <state-nameXarcXarc>* ) 

<arc>  ->  <W>  | <H>  ! <C>  j <J> 

<W>  ->  (CAT  !category-name>  <'action>*  (TO  < state -name >) ) 

(WRD  <termlnai  "word">  <action>*  (TO  <state-name>) ) 

<B>  ->  (PUSH  <state-name>  <action>*  (TO  < stat.e-name>) ) 

<C>  ->  (POP  <form>  <action>*) 

<J>  ->  (JUMP  <state-name>  <action>*) 

<action>  ->  <action1>  ! 

(SCOPE  <scope-spec>  <action1>  <actionl>*) 

<act ion  1 > ->  (VERIFY  <action1>)  ! 

<regi ster-setting-action>  I 
< structure-building-action>  ! 

<testing-action> 

<sccpe-spec>  ->  ( < state-name>  <state-name>* ) ! 

NIL  ! T 

Fig.  1.  Format  r>f  a HWJ.M  ATN  grammar 

Figure  1 shows  the  notation  we  use  for  describing  a HWIM  ATN  grammar; 
it  is  similar  to  most  other  ATN  formalisms.  In  referring  to  an  arc,  the 
arc  type  is  the  first  entry  (WRD,  CAT,  etc.)  and  the  arc  label  is  the 
second  element  of  the  arc.  There  will  be  no  discussion  of  the  operation  of 
a formal  ATN  machine  here,  since  that  has  been  described  fully  elsewhere. 
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The  purpose  of  this  formal  description  is  to  allow  us  to  show  the  new  and 
different  features  used  by  the  HWIM  parser. 

b ) State  Relation s used  in  HWIM  Parser 

In  this  section  we  will  refer  to  the  connection  of  two  states  by  a 
particular  type  of  arc  as  a relation  of  that  name  between  those  two  states. 
We  will  use  not  the  ATN  arc  types  (PUSH,  JUMP  etc.)  but,  in  most  cases,  the 
single  letter  given  in  Fig.  1 above.  Here,  W represents  a terminal 
consuming  arc  (i.e.,  a WRD  or  CAT  arc);  J,  a JUMP  arc;  and  P,  a PUSH  arc. 
In  addition,  we  will  use  D to  represent  the  relationship  between  the  state 
at  the  beginning  of  a PUSH  arc  and  the  state  pushed  to,  and  we  will  use  C 
for  the  relationship  between  the  state  at  the  beginning  of  a POP  are  and  a 
state  to  which  control  could  revert  at  a higher  level  after  executing  the 
POP  (i.e.,  a state  at  the  end  of  a PUSH  arc  that  could  push  to  the  network 
containing  the  POP).  Thus  B and  C represent  beginning  and  completing  a 
lower  level  of  the  grammar,  while  W,  P and  J represent  transitions  within 
the  same  level.  For  example,  if  there  exists  a jump  rare  between  STATE!  and 
STATE2  we  would  say  the  relation  3TATE1  J STATE2  holds. 

Since  all  the  above  relations  are  non-reflexive,  we  also  need  a set  of 
relations  for  right  to  left  operations:  LJ  for  Left  Jump,  UB  for  Un-Begin, 
UC  for  Un-Complete,  LW  for  Left  terminal  consuming,  and  LP  for  Left  Phrase 
consuming.  We  also  use  the  Kleene  star  to  indicate  0 or  more  occurrences 
of  a relation.  For  example,  the  relation  (STATE1  J*  STATES)  is  true  if 
STATE!  is  the  same  as  STATE?  or  if  there  exists  some  path  made  up 
exclusively  of  jump  arcs  between  STATE!  and  3TATE2. 

We  can  also  concatenate  a set  of  these  basic  relationships  to  describe 
a path  over  more  than  one  type  of  arc.  (The  only  restriction  is  that  they 
all  have  the  same  directionality).  Thus,  the  relation  (STATE1  J*BJ* 
STATE?)  is  true  if  (!)  there  is  some  path  from  STATE1  to  STATEx  that 
follows  0 or  more  JUMP  arcs  (STAi'EI  J*  STATEx);  (2)  there  is  a PUSH  arc  in 
STATEx  which  has  STATEy,  the  beginning  of  the  lower  level  transition  net, 
as  its  label  (STATEx  B STATEy);  and  (3)  there  is  some  path  of  0 >r  more 
JUMP  arcs  from  STATEy  to  3TATE2  (STATEy  J»  STATE?). 


-k- 


PUN  Wcport  No-  3^3U  - Vol . IV 


Bolt.  Beranek  and  Newman  Inc. 


We  will  use  these  relations  to  describe  sets  of  states  to  be 
considered  by  various  parts  of  the  algorithm.  The  J*  set  of  3TATE1  is  the 
set  of  all  those  states  for  which  the  relation  (STATE1  J*  STATEn)  is  true. 
If  one  is  looking  at  a state  STATE1  which  has  one  or  more  PUSH  arcs,  and 
one  wants  to  know  what  states  at  the  next  lower  level  might  have  an  arc 
that  could  consume  the  nc^  word  of  input,  the  BJ*  set  of  STATE!  would  be 
that  set  of  states. 

A few  more  comments  about  the  relations  are  in  order.  The  relations  C 
and  its  inverse  !JB  are  somewhat  more  complicated  than  just  following  an 
arc,  since  a POP  arc  does  not  of  itself  indicate  the  next  state  at  the 
higher  level  in  the  grammar.  The  relation  (STATE1  C STATE2)  is  true  if 
there  is  a PUSH  arc  which  has  STATEz  as  its  TO  state,  and  some  STATEx  as 

its  label,  i.e.  (PUSH  STATEx  <action>+  (TO  STATEz)),  and  there  is  some 

path  from  STATEx  to  STATE  1 at  the  same  level  in  the  grammar.  Since  UB  is 
the  inverse  of  C,  ( STATE1  C STATE2)  implies  ( STATE2  UB  STATED  and  vice 
versa . 

Once  we  think  of  these  relations  as  strings,  we  can  have  other 
connections  between  them  than  concatenation.  For  example,  to  describe  a 
path  across  a single  transition  network  of  the  grammar  we  can  say  (STATE1 
(W  or  J or  P)*  STATE2)  or,  as  a regular  expression,  (STATE1  (W+J+P)* 

STATE2) . Using  this  notation  we  can  now  name  seme  relations  that  are 

important  to  the  HWIM  parser.  These  are  B!  and  C!  for  left  to  right  and 
UC!  and  UB!  for  right  to  left. 

B!  = B ((J  + B)»  B )• 

C!  = C ( ( J + O*  C)»  ((J  + B)*  B)  • 

UB!  = UB  ( ( LJ  + UB)*  Ub)*  ( ( LJ  + UC)*  UC)* 

UC!  = UC  ((LJ  «-  UC)*  UC)* 

Even  more  useful  than  these  relations  arc  the  relations  J*B!J*, 
J*  C ! J* , U*UB!LJ*  and  LJ*UC!LJ*.  These  can  be  fairly  easily  described  in 
English  as  the  paths  that  will  reach  all  possible  states  on  other  levels  of 
the  grammar  that  can  be  reached  using  only  non-consuming  arcs.  The  utility 
of  these  indices  is  fairly  clear;  if  there  is  a segment  configuration 

ending  in  STATE!,  and  a new  word  is  to  be  added  to  the  island  that  can  be 
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consumed  Ly  an  arc  that  begins  at  STATE2,  then  only  if  (STATE1  ( J*  + 

+ J*C!J*)  STATE2)  is  true,  can  there  be  a path  that  will  connect  the  island 

to  the  new  wo: d . 

There  are  two  reasot  s why  we  break  the  relation  down  at  the  J*,  B! 

etc.  level.  First,  we  usually  treat  the  J*,  J* B ! J* , and  J*C!J*  cases 

separately,  and  secondly,  there  is  an  enormous  space  savings  to  storing 
only  the  B'  and  C!  sets  rather  than  the  J*B!J*  and  J*C!J*  sets,  which  are 
very  easily  computable  from  the  individual  .)*,  B!  and  C!  sets.  The  use  of 
these  sets  will  be  discussed  in  the  section  describing  the  parsing 
algorithm  (Sec.  5). 

3.  Parser  Data  Structures 

a)  States  of  the  Parsing  Proce ss 
i • Configurations 

In  order  to  handle  all  of  the  requirements  placed  on  the  Syntactic 
component,  it  rias  proved  useful  to  rethink  the  notion  - f a "state  of  the 
parsing  process".  In  a standard  left-to-right  ATN  pars*-,  , such  3 state  is 
described  by  a configuration . containing  (at  least)  a pointer  into  the 
input,  the  current  stat  of  the  grammar,  the  current  register  settings,  and 
a stack  of  next  states  and  register  settings  for  the  higher  levels  of  the 
computation.  However,  this  notion  of  a configuration  is  inadequate  for  the 
HWIM  parser.  First,  since  there  is  no  fixed  "input  string",  there  is  no 
way  to  have  a pointer  into  it.  Secondly,  both  ends  of  the  island  must  be 
indicated,  since  an  island  need  not  start  at  the  left  end.  These  facts 
also  invalidate  the  concept  of  a real  stack  on  which  all  higher  levels  of 
the  compi  ation  are  stored,  since  higher  levels  cannot  be  known  with 
certainty  unless  the  computation  begins  a the  left  end. 

These  problems  are  solved  by  redefin'ig  a configuration,  separating  it 
into  two  parts.  The  irst  part  is  called  an  island  configuration  (ICONFIG) 
and  contains  information  internal  to  the  island,  including  Its  word  matches 
and  an  indication  of  whether  the  island  reaches  either  end  of  the 
utterance.  The  second  part  contains  a list  of  segment  configurations 
(SCOiJFIGs),  which  represent  the  state  of  the  computation  for  each  level  of 
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the  grammar  in  which  any  word  of  the  island  can  be  consumed  A segment 
configuration  is  made  up  of  a left  boundary,  a right  boundary,  a .left 
state,  a right  state,  a list  of  the  register  settings  already  made  in  this 
level  of  the  parse,  a list  of  arc  actions  being  held  until  the  n ssary 

context  is  complete,  and  a list  of  states  that  have  been  passed  through  on 
this  level.  When  an  3C0NFIG  covers  a complete  traversal  of  one  level  of 
the  grammar,  a new  constituent  can  be  built  for  use  by  PUSH  arcs  at  the 
next  higher  level  of  the  grammar.  The  grouping  of  SCONFIGs  under  an 
ICONFIG  alleviates  the  need  for  a complete  stack  or  left  context. 

ii . Paths 

Two  adjacent  segment  conf igurations  (referred  to  for  brevity  as 
"segments”  in  the  subsequent  discussion)  will  be  said  to  be  compatible  if 
the  left  one  stands  in  the  (J*B!J*  + J* C ! J* ) relation  to  the  right  one 
(i.e.,  the  right  state  of  the  left  one  stands  in  that  relation  to  the  left 
state  of  the  right  one).  That  is,  two  adjacent  segments  are  compatible  if 
the  left  one  either  pushes  or  pops,  perhaps  indirectly,  to  the  right  one. 
A sequence  of  compatible  adjacent  segments  will  be  called  a path  provided 
that  there  is  no  subsequence  of  three  segments  such  that  the  first  stands 
in  the  J* B ! J*  relation  to  the  second,  which  in  turn  stands  in  the  J*C!J* 

relation  to  the  third.  (Whenever  such  a compatible  sequence  of  three 

segments  exists,  the  middle  one  can  be  completed  and  consumed  by  a PUSH  arc 
>.nat  incorporates  it  into  either  the  left  or  right  segment  or  perhaps 
combines  all  three  into  a single  segment.)  Associated  with  each  path  is  an 
indication  of  which  segment  is  at  a higher  level  in  the  grammar  than  any 
other.  Note  that  it  is  possible  to  have  two  paths  that  are  identical 
except  for  which  segment  is  chosen  as  the  top.  Marking  the  top  segment  of 
a path  divides  the  remaining  ones  into  two  groups,  which  must  then  be  at 
successively  lower  levels  as  they  get  further  away  from  the  top. 

Paths  are  only  necessary  for  making  predictions  off  an  island.  Since 
they  can  be  computed  from  an  island  configuration  and  a set  of  segment 
configurations,  it  is  not  necessary  to  make  them  permanent  data  structures. 
However  the  concepts  of  path  and  top  segment  are  very  helpful  when 

considering  how  to  connect  segments  together,  as  well  as  for  making 

predictions. 
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b ) jA  Middle-Out  V lew  of  an  ATN  Grammar 

In  the  original  presentation  of  AIN  grammars  [Woods,  1970],  a state 
was  viewed  as  the  collection  of  arcs  leading  out  of  it,  and  an  arc  was 
associated  with  its  begin-state.  While  this  conceptualization  is  adequate 
for  left  to  right  parsing,  if  we  try  to  use  it  from  right  to  left,  it 
becomes  necessary  to  search  all  states  to  find  those  arcs  that  can  reach  a 
given  state  from  the  left.  This  is  clearly  both  conceptually  awkward  and 
computationally  inefficient. 

For  UWIM's  parser,  it  is  more  useful  to  think  of  an  arc  as  a 
connection  between  two  states  that  can  be  traversed  in  either  direction. 
An  arc  then  is  associated  with  a left  state,  a right  state,  a type  (WRD, 
CAT,  etc.),  a label,  a set  of  context-free  actions  that  can  be  done  when 
the  arc  i first  encountered  regardless  of  the  direction  or  context,  and  a 
set  of  context  sensitive  actions,  which  will  be  deferred  until  adequate 
left  context  is  available.  A state  then  has  two  associated  collections  of 
arcs,  one  set  leading  to  the  left  and  the  other  leading  to  the  right.  In 

addition,  each  state  has  several  sets  of  other  related  states  associated 
with  it  (discussed  in  Sec.  3.d)  that  are  used  to  facilitate  the  parsing 
process. 

To  the  grammar  writer,  the  ATN  remains  basically  a left  to  right 
machine.  This  means  that  its  arc  actions  can  be  written  almost  as  if  the 
parser  were  operating  only  left  to  right.  The  grammar  has  actions  on  arcs 
where  they  should  be  executed  if  the  entire  appropriate  left  context  were 
set.  As  in  standard  lef t-to-right  grammars,  in  no  case  is  it  ever 
necessary  to  worry  about  the  right  context.  To  deal  with  actions  that 
require  a specific  left  context,  the  HWIM  parser  has  a mechanism  called 
"scoping”,  which  will  be  discussed  in  the  next  section. 

c ) .Scoping 

The  scoping  mechanism  allows  a grammar  that  was  created  from  a 
left-to-right  viewpoint  to  be  used  in  parsing  from  right  to  left.  During 
normal  left-to-right  processing  in  a standard  ATN  parser,  actions  call  for 
both  accessing  and  changing  the  contents  of  registers.  An  arc  action  in 
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the  grammar  has  a left-to-right  dependency  whe.i  it  either  requires  the 

value  of  a register  that  is  set  somewhere  to  the  left  or  changes  the  value 
of  a register  that  is  used  somewhere  to  the  left.  In  the  first  case, 

executing  the  action  without  the  left  context  could  not  produce  the  right 
effect,  while  in  the  second  case,  executing  it  prematurely  in  a 
right-to-left  parsing  would  cause  the  later  execution  of  the  arc  action  to 

the  left  to  get  the  wrong  value.  Such  arc  actions  are  referred  to  as 

context  sensitive . Fortunately,  fewer  than  half  of  the  actions  in  our  ATN 

grammars  turn  out  to  have  such  dependencies. 

By  analysis  of  the  paths  through  the  grammar,  it  is  possible  to 

determine,  for  each  context  sensitive  arc  action,  a set  of  states  having 
the  property  that  the  action  can  be  3afely  done  if  its  execution  ir.  a 

right-to-left  parse  is  delayed  until  the  parse  has  passed  through  one  of 

those  states.  We  refer  to  this  set  of  states  as  the  scope  of  the  action, 
an  indication  of  how  far  left  in  the  grammar  its  left-to-right  dependency 
extends.  The  HW^M  parser  contains  a mechanism  for  interpreting  explicit 
scope  declarations  on  arc  actions  by  saving  the  action  with  its  local 
context  (the  arc  on  which  it  occurred  and  the  input  being  considered  at  the 
time)  until  its  scope  is  satisfied  (if  it  is  not  already  satisfied  when  the 
action  is  first  considered). 

Two  special  cases  of  scope  declarations  are  also  included:  Scope  T is 

used  to  indicate  an  action  whose  execution  is  not  to  happen  until  the  left 
end  of  the  constituent  being  parsed  is  completed.  This  scope  indication  is 
necessary  for  actions  on  self-looping  paths  through  the  start  state  of  some 
level  of  the  network,  where  there  is  no  3tate  in  the  grammar  that  is 
guaranteed  to  be  left  of  all  dependencies.  Scope  NIL  is  used  to  indicate 
that  the  scope  of  an  action  is  entirely  local  - i.e.,  the  action  is  context 
free.  This  is  the  default  interpretation  for  any  action  that  is  not 
explicitly  marked  for  scope,  but  it  is  occasionally  useful  to  mark  it 
explicitly.  This  is  true  when  it  has  an  apparent  dependency  on  another 
register  but  analysis  of  all  preceding  paths  indicates  that  the  register  in 
question  cannot  be  set  anywhere  to  the  left.  In  this  case,  the  explicit 
NIL  scope  reminds  the  grammar  designer  that  the  question  of  scoping  has 
been  considered  for  this  action  and  that  the  answer  is  NIL. 
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The  format  of  a scope  statement  is  given  in  Figure  1 and  repeated 

here : 

(SCOPE  <scope-spec>  <action  1>  <action  1>#) 

<3cope-spec>  ->  (<state-name>  < state-name># ) ! NIL  ! T 

The  scope-spec  is  either  a list  of  state  names,  T,  or  NIL.  If  the  scope 
spec  is  a list  of  state  names,  then  the  action(s)  can  be  executed  when  the 
parse  begins  at  or  has  passed  through  any  state  in  that  list.  If  the 
scope-spec  is  T,  the  action  is  not  to  be  executed  until  the  parse  has 
hypothesized  the  left  and  for  that  level  of  the  grammar. 

An  example  of  scoping  occurs  in  the  state  S/WHAT-IS-IN-BUDGET . 

( S/ WHAT- IS- IN -BUDGET 

(JUMP  S/WHAT- I3-BUDGETED) 

(JUMP  S/POP 

(SCOPE  (S/WHAT-DO  S/WHAT-HAVE) 

(VERIFY  (GETR  LEFT)) 

(SETR  SUBJ  ( NPBUILD) ) ) ) ) 

(The  location  of  this  state  in  the  grammar  is  given  schematically  in  Figure 
2.)  From  this  state,  it  is  possible  to  jump  to  S/POP  to  end  the  utterance 
only  if  we  have  a question  such  as  "WHAT  IS  LEFT  IN  TOE  BUDGET."  In  these 
cases  the  register  LEFT  will  have  been  set  earlier.  If  we  are  parsing 
right  to  left,  we  must  know  how  far  left  we  must  carry  the  parse  before 
making  this  test.  The  JUMP  arc  to  S/POP  indicates  in  the  scope  of  the 
VERIFY  action  that  the  parse  must  have  passed  through  either  S/WHAT-DO  or 
S/WHAT-HAVE. 

It  is  possible  to  construct  an  algorithm  to  compute  the  scopes 
automatically  if  the  dependencies  of  the  arc  actions  are  explicitly  marked, 
but  we  have  not  yet  implemented  such  a facility. 

d)  Storing  the  Grammar  for  the  HWIM  Parser 

As  demonstrated  in  the  LUNAR  system  [Woods  et  al . , 1972],  it  is 

possible  to  run  a parser  by  directly  interpreting  the  ATN  syntax  as 
described  above.  In  the  SPARSER  system  [Bates,  1975b],  the  grammar  is  also 
used  as  given,  augmented  by  a set  of  extra  indices  connecting  the  grammar 
from  right  to  left  and  relating  the  different  transition  networks  together 
in  a helpful  manner.  In  the  SOPHIE  system  [Burton,  1976;  Burton  and  Wood3, 
1976]  on  the  ether  hand,  the  ATN  grammar  has  been  "compiled"  for  speed  and 
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efficiency  into  a single  LISP  function,  though  it  still  operates  from  left 
to  right.  As  our  desire  was  both  to  gain  speed  and  efficiency  as  wpII  as 
to  permit  bi-directional  parsing,  the  new  HWIM  parser  uses  techniques  from 
both  SPAhSEH  and  SOPHIE.  It  pre-processes  the  grammar,  smashing  it  into 
several  arrays  that  describe  the  grammar  in  both  directions.  The  arc 
arrays  contain  the  left  and  right  states  of  each  arc  in  addition  to  the 
type,  label,  context  free  actions,  ar.d  context  sensitive  actions  associated 
with  that  arc.  The  state  arrays  contain  the  state  name,  left  and  right 
going  arcs,  and  several  precomputed  state  relation  sets.  The  sets 
currently  stored  are  J*,  B,  C,  B!  and  C!  and  their  right-to-left 
counterparts.  These  sets  are  important  to  the  parsing  algorithm  in  the  way 
that  the  L*  set  used  by  Earley  [ 1970]  is  in  his  de scripti’-n  of  an  efficient 
context-free  parsing  algorithm.  Their  use  is  described  m detail  below. 

4 . Syncax-Control  Interface 

a ) The  Nature  of  the  Interface 

In  the  introduction  to  this  section  we  briefly  discussed  the  four 
basic  functions  filled  by  the  Syntactic  component  in  HWIM'S  control 
strategy.  Here  we  will  describe  its  interface  with  Control. 

In  the  current  implementation  of  HWIM,  the  Syntactic  and  Control 
components  inhabit  separate  TENEX  forks,  using  a shared  file  for 
communication.  In  order  to  isolate  the  implementation  specific  details  of 
the  interface,  Control  is  provided  with  a set  of  functions  that  have  the 
same  names  and  I/O  characteristics  as  the  top  level  functions  in  the  Syntax 
fork  and  handle  the  task  of  calling  their  counterparts  there. 

b ) High-1 evel  Par ser  Functions 

The  top  level  parser  functions  are  designed  to  operate  on  only  one  new 
word  at  a time.  That  is,  the  parser  can  either  consider  an  initial  one 
word  ("seed")  island  or  add  one  new  word  to  an  existing  island.  This 
simplicity  holds,  even  though  HWIM's  control  strategies  have  become  quite 
complex  (see  Vol.  3i  Sec.  E).  The  complexity  at  the  control  level  lies  in 
the  selection  of  the  next  word  to  consider  and  the  number  of  seeds  in 
contention  at  the  same  time. 
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The  following  is  a brief  description  of  the  major  high-level  parser 
f unctions: 

SYNSEED  is  the  function  used  to  create  an  island,  given  a new  word 
match  and  the  theory  number  to  be  assigned  to  the  new  island.  SYNSEED 
calls  the  function  STARTISLAND,  which  creates  all  possible  segments  for  the 
word  match. 

SYNEVENT  adds  a new  word  to  an  existing  island,  given  its  associated 
theory  number,  the  new  theory  number  to  be  assigned,  the  new  word  match  to 
be  added,  and  the  side  onto  which  the  new  word  is  to  be  added. 

SYNEND  takes  an  old  theory,  a new  theory,  and  a direction  and  causes 
the  old  theory's  island  to  be  connected  to  the  initial  state  or  a final 
state  of  the  grammar  depending  on  the  direction.  SYNEND  is  necessary 
because  islands  are  normally  only  extended  through  their  final  consuming 
arcs  at  either  end,  and  the  final  non-consuming  arcs  necessary  for 
completion  are  not  done. 

SYNEVENT  and  SYNEND  can  call  CONNECT-LEFT  and  CONNECT-RIGHT  to  connect 
an  arc  or  a state  to  an  existing  island  in  all  possible  ways.  CONNECT-LEFT, 
CONNECT-RIGHT,  and  STARTISLAND  make  up  the  guts  of  the  parser  algorithm  and 
will  be  discussed  in  Sec.  5. 

The  function  SYNCOLLISION  joins  two  existing  islands  with  a new  word 
added  between  them.  It  takes  tt.e  theory  numbers  of  the  two  existing 
islands,  the  new  theory  number  to  be  assigned  to  the  result,  and  the  new 
word  match  that  connects  the  two  islands.  Though  it  appears  to  violate  the 
parser's  "one-word-at-a-time"  processing,  it  is  in  fact  implemented  by 
taking  the  larger  of  the  two  islands  and  adding  to  it  both  the  new  word  and 
the  words  from  the  other  island  one  at  a time.  SYNCOLLISION  is  provided  to 
Control  for  convenience  and  efficiency. 

SYNSEED,  SYNEVENT,  SYNEND,  and  SYNCOLLISION  all  produce  the  same 
result.  They  return  a list  consisting  of: 

1)  a flag  indicating  if  there  were  any  possible  parses  or  a complete 

parse  for  the  island, 

2)  the  score  of  the  best  path  through  the  island, 

3)  the  parse  tree  if  there  was  a complete  parse, 
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4)  a list  ol'  categories  that  could  be  adjacent  to  the  island  on  the  left, 
‘j ) a list  of  words  that  could  be  similarly  adjacent, 

b)  a list  of  categories  that  could  be  adjacent  to  the  island  on  the 

right,  and 

7)  a list  of  words  that  could  be  on  the  right. 
b • The  Principal  Parser  Funct ion s 

In  this  section  we  will  describe  the  main  functions  in  the  HWIM 
parser,  as  well  as  give  a fairly  complete  step  by  step  description  of  each. 
The  step  by  step  descriptions  will  give  some  idea  of  how  the  parser 
accesses  information  and  groups  its  computations.  The  functions  described 
are  all  suffixed  -RIGHT  to  indicate  that  they  are  used  when  adding  a new 
word  to  the  right  of  an  existing  island.  There  is  a set  of  similar 
functions  (suffixed  -LEFT) . which  work  on  adding  words  to  the  left.  The 
differences  between  the  sets  are  small,  brought  on  by  the  fact  that  a new 
level  is  begun  at  one  particular  state  on  the  left  (the  label  of  the  PUSH 
arc)  but  can  pop  from  any  of  a number  of  accepting  states  (states  with  POP 
arcs)  on  the  right.  These  differences  usually  involve  only  an  extra  loop 
to  be  executed  a bounded  number  of  times. 

a)  Starting  an  Island 

To  begin  a one-word  island,  the  parser  finds  all  arcs  in  the  grammar 
that  can  consume  the  new  word  match,  using  an  index  of  pointers  from  each 
word  and  category  to  the  arcs  that  consume  them.  It  processes  those  arcs, 
creating  a segment  configuration  (SCONFIG)  for  each  one.  These  are 
;ollected  into  an  island  configuration  (ICONFIG),  which  is  then  processed 
to  find  the  proposals  to  return  to  Control.  The  proposal  mechanism  will  be 
discussed  Section  5.c.v. 

b)  Processing  an  Arc 

The  basic  transition  function  in  the  parser,  DOARC,  takes  an  old 
SCONFTr;,  an  adjacent  arc,  and,  if  it  is  a consuming  arc,  a word  or 
constituent  to  be  consumed.  The  result  is  a new  SCONFIG  representing  the 
state  of  the  computation  after  the  new  arc  and  input  have  been 
incorporated.  DOARC  is  applied  to  a list  of  SCONFIGs  to  be  processed  for  a 
given  arc  and  direction.  The  result  is  a new  list  of  SCONFIGs. 
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DOARC(SCONFIGLIST  ARC  WORD/CONSTIT  DIRECTION): 

1)  For  each  SCONFIG  do  2 through  N 

2)  Verify  that  the  arc  is  adjacent  to  the  old  SCONFIG; 

3)  Set  up  context  for  arc  actions 

- set  atom  REGS  to  register  list  m SCONFIG 

- set  atoms  LEX,  *,  and  FEATURES  according  to  the 
input  being  consumed; 

4)  Do  the  context  free  actions  on  the  arc; 

5)  For  each  context  sensitive  action  on  the  arc: 

- if  the  context  is  not  yet  complete,  create  an  undone 
scoped  action  to  be  added  to  the  new  SCONFIG 

- otherwise  evaluate  the  action. 

6)  If  the  direction  is  to  the  left,  see  if  the  scope  of 

any  saved  actions  in  the  old  SCONFIG  has  now  Been  satisfied. 
If  so,  evaluate  these  actions. 

7)  Build  up  the  new  SCONfIG  by 

- setting  the  new  boundaries  and  end  states 

- 'aving  the  registers  and  undone  scoped  actions 

- adding  the  left  state  of  the  arc  to  the  list  of 
states  that  the  computation  has  passed  through. 


c ) Connecting  a New  Word  to  an  I sland 


This  section  describes  the  process  of  adding  a new  word  to  the  right 
of  an  existing  island.  Since  both  connecting  a new  word  on  the  left  of  an 
island  and  connecting  to  either  end  of  the  grammar  are  very  similar  to 
this,  we  will  not  describe  them  here. 


In  extending  a segment  at  the  end  of  an  old  island  to  meet  a new  word 
to  be  added,  the  sequence  of  intervening  non-ccn suming  arcs  that  make  the 
connection  can  be  all  at  the  same  level  of  the  network  (the  J*  case);  they 
can  change  to  a lower  level  (the  J*B ! J*  case);  or  they  can  change  levels 
in  a way  equivalent  to  one  or  more  POPs  followed  by  zero  or  more  PUSHes 
(the  J*C ! J*  case).  Depending  on  the  type  of  connection,  and  the  stage  of 
the  extension,  CONNECT-RIGHT  keeps  its  partially  extended  segments 
(actually  groups  of  functionally  equivalent  segments)  in  one  of  the 
following  six  queues:  v 

1)  TODOQ  - contains  all  segments  that  have  not  changed  levels  at  all 
Initially  this  is  all  right  boundary  segments  from  the  old  island. 
Later  when  a segment  from  the  old  island  is  expended  to  include  a new 
constituent,  it  is  placed  in  the  TODOQ. 

2)  C! Q - contains  all  segments  that  have  changed  to  a higher  level  and 
may  now  only  stay  at  that  level  or  go  down  into  a new  constituent. 

3)  C! OPENLEFTQ  - contains  those  segments  from  the  C!Q  that  are  incomplete 
on  the  left  (i.e.,  have  not  been  pushed  for). 
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4 ) C ! COMP^ETEQ  - contains  the  remainder  of  the  C!0,  those  segments  that 
are  now  complete  3nd  are  ready  to  form  hole  constituents. 

5)  B ! Q - contains  those  segments  that  have  changed  to  a lower  level  and 
therefore  must  not  change  levels  again. 

t> ) DONEQ  - contains  those  segments  whose  right  end  state  has  a consuming 
are  for  the  word  being  added  to  the  island. 


i De script  ion  of  CONNECT-RIGHT 


The  function  CONNECT-RIGHT  takes  SCONFIGs  that  terminate  at  the  right 
boundary  of  an  existing  island  and  joins  them  to  a set  of  arcs  that  can 
consume  the  new  word  to  be  added  to  the  island. 


CONNECT-RIGHT(SCONFIG- GROUP-LIST  TOARCS  WORD): 

1)  Create  a set  of  J*TOSTATES,  which  are  the  states  that  "an  J*  to  the 
left  state  of  at  least  one  of  the  consuming  ares  for  the  new  word. 

2)  Put  all  groups  of  SCONFIGs  that  are  at  the  right  boundary  of  the  old 
I CONFIG  into  the  TODOQ . 

3)  If  there  is  anything  in  the  TODOQ,  call  EXTEND- PATHS- RIGHT  with  the 
whole  TODOQ  and  number  of  "directions"  set  to  3.  (See  See.  5-c.il  for 
a discussion  of  EXTEND-PATHS-RIGHT  and  the  notion  of  "direction".) 

4)  If  there  is  anything  in  the  C!Q,  call  SPLITGJQ  to  separate  it. 

5)  If  there  is  anything  in  „he  CICOMPLETEQ,  call  COMPLETE-RIGHT  to  create 
the  constituents  and  process  the  PUSH  arcs  that  can  use  them.  Then  go 
back  to  3- 

6)  If  there  is  anything  in  the  C! QPENLEFTQ  then  eall  EXTEND-PATHS-RIGHT 
on  the  whole  CIOPENLEFTQ  with  number  of  directions  set  to  2. 

7)  If  there  is  anything  in  the  B!Q  then  eall  EXTEND-PATHS-RIGHT  on  the 
B!Q  with  the  number  of  directions  (TRYDIRS)  set  to  1. 

6 > Call  DOARC  (see  Sec.  5.b)  for  eaeh  consuming  are  and  the  appropriate 
group  of  SCONFIG  groups  in  the  DONEQ. 

9)  Collect  the  results  of  the  DOARC  calls  together  to  form  the  new 
I CONFIG. 


ii.  Description  of  tXTENP- PATHS- RIGHT 


EXTEND-PATHS-RIGHT  causes  jumps  to  states  that  either  begin  a 
consuming  arc,  push  for  a constituent  that  contains  the  consuming  arc  or 
pop  from  the  eurrent  constituent  to  a higher  level  that  contains  (or  ean 
then  push  for)  the  consuming  arc.  EXTEND-PATHS-RIGHT  begins  from  an 

SCONFIG  group  whose  members  share  the  same  right  boundary.  It  completes 
the  JUMP  paths  on  that  level  that  are  required  to  reach  the  states 
mentioned  above. 
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EXTEND-PATHS-HIGHT  is  used  in  several  contexts  in  CONNECT-RIGHT  (see 
Sec.  5.c.i),  so  it  must  have  a way  of  deciding  which  of  the  above  states 

are  relevant.  To  do  this  it  must  be  supplied  with  a number  indicating  how 

many  directions  to  look  for  a connection.  If  the  number  of  directions 

(TRYDIRS)  is  1,  then  EXTEND- PATHS-RIGHT  only  considers  states  on  the  same 
level  as  the  input  SCONFIGs,  i.e.,  the  states  in  the  union  of  the  J*  sets 
of  the  right  end  states  of  the  input  SCONFIGs.  If  any  of  those  states  is 
the  left  state  of  a consuming  arc  (i.e.,  a TOSTATE) , then  all  possible  Jump 
paths  to  reach  that  state  are  executed  using  DOAHC  (Sec.  5.b).  The 

resulting  SCONFIGs  are  collected  into  a group  that  is  associated  with  the 
TOSTATE  and  placed  in  the  DONEQ. 

If  the  value  of  TRYDIRS  is  2,  then  EXTEND-PATHS-RIGHT  considers  paths 
that  lead  to  a lower  level  constituent,  as  well  as  ones  on  the  same  level 
as  the  input  SCONFIGs.  Each  of  the  states  in  the  union  of  the  J*  sets  is 
also  tested  to  see  if  it  can  reach  a state  in  J*TOSTATES  via  a B! 
transition.  If  so,  then  all  jump  paths  are  completed  from  the  input 
SCONFIGs  to  that  state,  and  the  resulting  grouD  of  SCONFIGs  is  placed  in 
the  BIO 

If  the  number  of  directions  (TRYDIRS)  is  3,  EXTEND-PATHS-RIGHT  also 
considers  paths  that  complete  the  segments  of  the  input  SCONFIGs  and  return 
to  a higher  level.  These  higher  levels  may  contain  one  of  the  consuming 
arcs  or  may  then  push  for  new  constituents  that  contain  a consuming  arc. 
Note  that  these  paths  are  just  the  C!  set  described  in  Section  2.b.  For 
each  state  that  can  reach  a state  in  J*TOSTATES  via  a C!  transition,  all 
jump  paths  from  the  input  SCONFIGs  are  created,  and  the  resulting  group  of 
SCONFIGs  is  placed  in  the  C!Q. 


EXTEND- PATHS- RIGHT (QUEUE  TRYDIRS) : 

1)  Compute  the  set  of  FROMSTATES  from  the  union  of  the  J* 
right  end  states  of  the  input  SCONFIGs  in  the  QUEUE. 


sets  of  the 


2)  Compute  J*TOSTATES  from  the  union  of  the  LJ*  sets  of  the  left  states 
of  the  TOARCS  (the  consuming  arcs  for  the  word  being  added  in  the 
event) . 

3)  For  each  state  in  FROMSTATES  do  4 through  6. 

14)  If  FROMSTATE  is  a left  state  of  any  TOARC.  then  make  all  Jump  paths 
' leading  to  FROMSTATE  from  any  of  the  Input  SCONFIGs.  This  is  done 
with  the  function  GETI'ATHS-RIGHT , which  builds  up  jump  paths  and  keeps 


-17- 


BUN  Report  No.  - Voi . IV 


Bolt  Beranek  and  Newman  Inc. 


a table  of  states  t.idt  have  been  connected  to  the  input  SCONFIGs.  In 
this  way,  the  JUMP  arcs  are  executed  when  they  are  needed.  If  they 
are  needed  again,  the  previous  results  are  used. 

5)  If  TRYDIRS  is  2 or  3 and  (FROMSTATE  B!  J*T0STATE)  is  true  for  any 
state  in  J*T0S'f ATfcS,  then  GETPATHS-RIGHT  is  called  for  FROMSTATE  and 
the  resulting  SCONFIGs  are  placed  in  the  B!Q. 

b)  If  TRYDIRS  is  3 and  (FROMSTATE  C!  J* TOST ATE)  is  true  for  any  state  in 
J* TOSTATE  S,  then  GETPATHS-RIGH7  is  executed  for  FROMSTATE  and  the 
resulting  SCONFIGs  are  placed  in  the  C!Q. 

iii . Description  of  3PLITC1Q 

The  C!Q  queue  contains  all  segments  that  have  reached  an  accepting 
state  for  the  input  SCONFIGs.  However,  there  is  a major  difference  between 
those  of  its  segments  which  a. e now  complete  at  both  ends  and  those  which 
are  still  incomplete  at  one  end.  In  the  first  case,  the  appropriate 
constituent  must  be  created  and  the  process  resumed  at  the  higher  level 
(COMPLETE-RIGHT).  In  the  second  ca^e,  however,  all  states  in  the  C!  set  of 
the  right  state  of  the  segment  must  still  be  considered  for  extension, 
since  the  segment  has  no  left  context. 

SPLlTCfQ  checks  the  left  boundary  of  each  segment.  If  it  is  the  left 
boundary  of  the  island,  then  the  segment  is  clearly  open  and  is  put  in  the 
C ! OPENLEFTQ . If  it  is  within  the  island,  then  SPLITCiQ  picks  up  ESTATES , 
the  list  of  right  end  states,  from  the  list  of  segments  whose  right  end 
coincides  with  the  left  end  of  the  segment  being  considered.  This  is  done 
by  keeping  an  index  of  segments  in  the  island  by  right  boundary.  If  there 
is  an  intersection  between  the  UB!  set  of  the  left  end  state  of  the  SCONFIG 
in  the  C!Q  and  ESTATES,  then  the  SCONFIG  could  have  been  pushed  for  from 
one  of  those  segments,  so  the  SCONFIG  is  put  in  the  C ! COMPLETE} . If  there 
is  an  intersection  between  the  UC!  set  of  left  end  states  of  the  SCONFIG 
and  ESTATES,  then  it  is  possible  for  the  SCONFIG  to  belong  to  a path  in 
which  it  is  the  highest  level  SCONFIG.  It  is  therefore  put  in  the 

CiOPENLEFTQ.  It  is  possible  for  an  SCONFIG  to  be  in  both  queues  as  a 
result  of  different  possible  paths.  If  an  SCONFIG  has  been  put  in  neither 
queue  though,  a processing  error  has  occurred,  in  that  the  island  did  not 
contain  a proper  set  of  segments. 
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SPLITCSQ: 

1)  For  every  SCONFIG  In  CIQ  do  2 through  5. 

2)  If  the  left  boundary  of  SCONFIG  (LBDRY)  is  the  same  as  the  left 
boundary  of  the  ICONFIG  being  processed,  then  put  the  SCONFIG  in  the 
C! OPENLFFTQ . Skip  3 thru  5. 

3)  Collect  a set  of  RSTATES  from  the  union  of  the  right  end  states  of  the 
segments  that  have  a right  boundary  equal  to  LBDRY. 

H)  If  the  intersection  of  RSTATES  and  the  UB!  set  of  LSTATE  (the  left  end 
state  of  SCONFIG)  is  non-empty,  then  put  SCONFIG  in  the  CICOMPLETEQ. 

5)  If  the  intersection  of  RSTATES  and  the  UC!  set  of  LSTATE  is  nor-empty, 
then  put  the  SCONFIG  in  the  C10PENLEFTQ. 

iv . Description  of  COMPLETE-RIGHT 

In  the  course  of  adding  a new  word  to  the  right  of  an  existing  island, 
COMPLETE-RIGHT  is  called  if  a segment  that  is  complete  at  its  left  end 
encounters  a final  state  on  its  right.  COMPLETE-RIGHT  creates  the  new 
constituent  to  be  popped,  joining  it  to  the  island  in  one  of  two  ways.  It 
may  add  it  to  an  existing  segment  at  tne  next  higher  level,  or  if  such  a 
segment  does  riot  exist,  it  will  create  a new  intermediate  segment  for  each 
arc  that  can  use  the  completed  constituent  and  is  compatible  with  some 
SCONFIG  of  the  left. 

Completing  the  segment  and  creating  the  new  constituent  COMPLETE-RIGHT 
causes  all  undone  scoped  actions  to  be  executed  and  creates  the  new 
constituent  from  the  label  of  the  FOP  arc.  COMPLETE-RIGHT  then  looks  to 
see  if  any  existing  segment  immediately  to  the  left  of  the  new  constituent 
could  have  pushed  for  it.  If  so,  its  PUSH  arc  is  executed  to  include  the 
new  constituent.  COMPLETE-RIGHT  al30  looks  to  see  if  any  segment  to  the 
left  of  the  new  constituent  can  push  for  it  indirectly  and  still  reach  the 
consuming  arc  for  the  event.  For  each  of  these  cases,  COMPLETE-RIGHT  must 
create  a new  segment,  extend  it  from  its  beginning  state  to  the  state  that 
pushes  for  the  new  constituent,  and  execute  that  push  arc  to  create  a new 
segment  that  is  complete  on  the  left  and  that  includes  the  new  constituent 
on  the  right. 

After  a segment  has  been  extended  to  include  the  new  constituent,  it 
is  placed  back  in  the  TODOQ  because  it  may  need  to  be  further  extended. 
The  CONNECT-RIGHT  function  will  deal  with  this  correctly  since  the  new 
segment  behaves  very  much  like  the  segments  that  were  in  the  old  island. 
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COMPLETE-RIGHT: 

1)  Select  from  the  C1C0MPLETEQ,  all  those  SCONFIGs  that  cover  the 
smallest  section  of  the  utterance  D.e.,  those  with  the  right-most 
left  boundary). 

2)  For  each  SCONFIG  selected,  do  all  remaining  undone  scoped  actions. 

3)  For  each  SCONFIG,  do  the  POP  arc  action  (saved  as  a scoped  action  with 
scope  POP)  to  create  the  constituent.  Group  the  constituents  by  start 
state.  These  groups  will  represent  ambiguous  parses  of  a constituent 
with  the  same  Input. 

4)  Find  all  SCONFIGs  in  the  old  island  that  end  at  the  same  place  the  new 
constituents  start.  Name  these  the  FUSH-SCONFIGs.  Name  the  union  of 
the  right  states  of  the  PUSH-SCONFIGs  the  PUSHSTATES. 

5)  For  each  constituent  group  formed  in  3,  do  6 through  7. 

6)  Find  all  arcs  that  can  PUSH  for  the  new  constituent  (i.e.,  the  label 
of  tne  PUSH  arc  is  the  same  as  the  start  state  of  the  constituent), 
and  satisfy  ( RSTATE  is  a member  of  TOSTATE)  or  ( RSTATE  J*B! J*  TOSTATE) 
where  RSTATE  is  the  right  state  of  the  PUSH  arc  and  TOSTATE  is  any 
state  in  TOSTATES  described  under  CONNECT-RIGHT. 

7)  For  each  such  PUSH  arc  do  8 through  9- 

8)  For  all  PUSH-SCONFIGs  whose  right  state  is  the  same  as  the  left  state 
of  the  PUSH  arc,  extend  the  segment  by  doing  the  PUSH  aro  for  each 
constituent  in  the  group,  Put  each  resulting  SCONFIG  on  the  TODOQ. 

9)  For  each  PUSH-SCCNFIG  whose  right  state  can  B!J*  to  the  left  state  of 
the  PUSH  arc  (LSTATE).  do  10  through  11. 

10)  Find  all  states  STATE  1 such  that  (RSTATE  of  PUSH-SC0NF1G  R!  STATED 
and  ( STATE  1 J*  LSTATE). 

11)  Create  a new  segment  for  each  path  from  each  STATE1  to  LSTATE.  Extend 
each  of  those  segments  by  doing  the  PUSH  arc  for  each  of  the 
constituents  in  the  group.  Put  the  resulting  SCONFIGs  on  the  TODOQ. 


v . Description  of  PREDICT-RIGHT 


PREDICT-RIGHT  creates  a list  of  all  terminal  consuming  arcs  that  can 
be  reached  by  any  path  of  non-consuming  arcs  from  the  right  end  of  an 
i3lar.d.  The  word  and  category  predictions  made  by  the  Syntactic  component 
are  collected  from  this  list  and  a corresponding  one  for  the  left  end  of 
the  island. 


It  is  not  sufficient  for  PREDICT-RIGHT  to  list  all  arcs  that  can  be 
reached  from  states  that  can  end  the  island;  these  predictions  must  be 
restricted  by  segment  configurations  within  the  island  that  are  at  a higher 
level  than  segments  on  the  right.  To  make  the  list  of  predicted  arcs, 
PREDICT-RIGHT  groups  the  SCONFIGs  by  end  states  and  boundaries  so  that 
ambiguous  parses  of  one  level  are  together.  For  each  SCONFIG  group  at  the 
right  boundary,  PREDICT-RIGHT  checks  to  see  if  it  can  reach  the  left  end  of 
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the  island  without  passing  through  a higher  level  segment.  If  so,  then  all 
consuming  arcs  in  all  states  reachable  by  (J*  + J*BiJ»  + J* C ! J* ) are 
predicted.  If  not,  PREDICT-RIGHT  divides  the  predictions  into  two  cases. 
First,  all  terminal  consuming  arcs  in  states  reachable  from  the  SCONFIG  by 
(J*  + J* B ! J* ) are  predicted.  Secondly,  for  each  SCONFIG  that  can  push  for 

the  segment  at  the  right  boundary,  PREDICT-RIGHT  repeats  the  prediction 
process  on  the  higher  level  SCONFIG  as  if  its  right  end  state  were  the 
right  state  of  the  PUSH  arc  that  pushes  for  the  segment  at  the  right 
boundary.  As  a result  of  this  process,  all  predictions  on  the  right 
compatible  with  possible  paths  across  the  island  have  been  generated. 

6 . Conclusion 

In  this  section,  we  have  attempted  to  give  a detailed  exposition  of 
hWIM's  middle-out,  bi-directional  ATN  parsing  algorithm  in  sufficient 
detail  that  it  could  be  implemented  by  other  experimenters  and  its 
properties  verified  either  theoretically  or  empirically.  We  could  have 
gone  into  more  meticulous  detail  and  attempted  to  give  a proof  of  its 
correctness,  but  space  and  time  constraints  on  this  report  have  made  that 
impossible.  We  believe  that  the  exposition  is  essentially  correct  and  hope 
that  it  is  useful  to  others. 
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B.  GRAMMARS 

1 . HI storlcal  Perspective 

The  very  first  grammar  to  be  used  as  part  of  the  BBN  speech  system 
was  a subset  of  the  LUNAR  grammar  [Woods  et  al . , 1972].  Althougn  it 
initially  comprised  only  11  states,  it  was  gradually  enlarged  until  it 
reached  83  states.  It  was  named  SPEECHGRAMMAR . 

In  adapting  ATN  grammars  for  use  in  the  uncertain  world  of  speech 
processing,  a number  of  changes  were  made  to  the  ATN  grammar  formalism. 
First,  the  test  portion  of  each  arc,  which  was  formerly  a single  LISP  form, 
was  broken  into  two  tests,  one  comprising  the  context-free  checks  that 
could  be  made  on  the  current  word  and  its  syntactic  features  and  the  other, 
the  context-sensitive  tests,  which  required  information  stored  in 
regi ster s. 

Another  change  to  the  grammar  formalism  as  the  removal  of  SENDR  and 
HOLD  actions.  This  did  not  in  any  way  reduce  the  power  of  a grammar  and 
moreover  facilitated  more  efficient  local  parsing.  As  a consequence,  a 
relative  clause  could  be  parsed  and  stored  in  a well-formed-substring 
table  (WFST)  as  a complete  constituent  in  itself,  rather  than  only  as  part 
of  some  noun  phrase.  In  such  a relative  clause  constituent , a dummy  node 
such  as  **NP#*  would  be  placed  in  the  parse  tree  to  hold  a place  for  the 
head.  This  would  then  be  replaced  with  the  appropriate  structure  on  the 
PUSH  arc  looking  for  a relative  clause.  This  method  eliminated  the  need  to 
reparse  the  relative  clause  each  time  there  were  two  or  more  alternative 
heads  posed  for  the  construction. 

The  most  important  characteristics  of  the  original  grammar  were  that 
it  processed  words  by  their  usual  parts  of  speech  and  that  it  constructed 
ordinary  syntactic  parse  trees  for  a wide  variety  of  complex  syntactic 
constructions  such  as  reduced  and  unreduced  relative  clauses,  sentential 
complements,  simple  conjunctions,  possessives,  nominal  compounds,  and 
adverbial  modifiers. 
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2 . The  F Ir st  Speech  Parser 


As  SPEECHGRAMMAR  was  being  developed,  a parser  was  also  under 
construction  that  was  quite  different  in  organization  and  operation  from 
that  of  the  LUNAR  system.  The  main  features  of  this  parser  were  the 
following: 

1)  It  was  designed  to  start  parsing  anywhere  in  the  input  stream  and  to 
parse  despite  the  lack  of  certainty  as  to  the  exact  nature  of  the 
words  at  each  point  in  the  input. 

2)  Complete  constituents,  when  found,  were  stored  in  a 
well-formed-substring  table  along  with  their  features,  boundaries,  and 
a semantic  evaluation  of  their  meaningfulness  so  that  they  might  be 
used  by  any  other  parse  path  which  needed  a constituent  of  that  type 
at  the  same  place  without  reparsing. 

3)  As  partial  parse  paths  were  built  up,  their  pieces  were  also  stored  in 
tables  so  that  any  other  parse  that  could  use  them  did  not  need  to 
reparse  common  sections  of  input. 

4)  Using  the  grammar,  the  parser  could  make  predictions  about  the  words 
or  syntax  classes  that  could  be  used  to  extend  a sequence  of  words  in 
a theory  either  to  the  right  or  to  the  left.  If  a gap  between  words 
was  small  enough  to  contain  just  one  word,  the  parser  could  predict 
just  the  class  or  classes  of  words  to  fill  the  gap. 

5)  The  control  structure  of  the  parser  could  be  modified  fairly  easily  to 
allow  experimentation  with  various  combinations  of  backup,  sequential, 
and  parallel  search.  It  used  a combination  of  depth-first  and 
breadth-first  techniques.  usually  following  a single  path  but 
splitting  into  parallel  paths  when  desirable. 

6)  Care  was  taken  to  allow  the  parser  to  interact  frequently  and  easily 
with  other  components  of  the  system  (notably  Semantics)  in  order  to 
receive  guidance  and  to  verify  completed  constituents. 

7)  Although  at  any  given  moment  the  parser  was  concerned  with  only  one 
theory,  its  data  base  contained  all  the  information  it  discovered  in 
processing  previous  theories,  thus  allowing  considerable  sharing  of 
information  without  duplicating  effort.  This  organization  allowed  for 
the  occurrence  of  some  event  (such  as  the  completion  of  a constituent) 
to  alert  the  Control  component  to  the  fact  that  certain  previously 
processed  theories  might  be  affected  by  the  event  and  should  be  queued 
for  further  processing. 


For  a complete  description  of  this  parser,  see  [Bates,  1975a;  Bates, 
1975b] . 

This  parser  depended  on  semantic  and  pragmatic  pre-selection  of  sets 
of  possible  words  from  the  word  lattice  to  cut  down  the  combinatorial 
explosion  of  possible  partial  parse  paths.  It  was  able  to  use  semantic 
guidance  during  parsing  using  functions  that  could  be  called  on  arcs  of  the 
grammar,  and  it  ordered  the  parse  paths  so  that  the  best,  i.e.,  most  likely 
according  to  some  predetermined  metric,  could  be  tried  first  using  a 
basically  top  down  algorithm  with  backup  to  less  likely  alternatives.  By 
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U3ing  a well-fo"med-substring  table  in  which  constituents  were  placed 
together  with  a score  of  semantic  well-formedness,  the  parser  was  able  to 
use  semantic  information  to  help  order  alternative  possibilities. 

3-  Text  Grammars  vs.  Speech  Grammars 

In  parsing  text,  one  may  usually  assume  that  the  input  is  grammatical 
and  the  words  are  known  precisely  and  correctly.  In  a speech  environment, 
however,  one  must  assume  that  even  if  a grammatical  sentence  is  uttered, 
there  will  be  enough  error  in  the  acoustic  and  lexical  processing  to 
produce  high  scoring  but  incorrect  (and  frequently  ungrammatical)  sequences 
of  words.  Thus,  it  is  not  enough  that  the  grammar  recognize  correct 
sentences  — it  must  also  be  able  to  identify  ungrammatical  sequences  as 
soon  as  possible  in  order  to  reject  them.  This  means  that  given  any 
sequence  of  words  within  a sentence  (not  necessarily  starting  at  the 
beginning)  the  grammar  should  predict  only  those  adjacent  words  that  would 
make  a sequence  that  could  actually  be  part  of  a grammatical  sentence.  A 
great  deal  of  time  was  spent  on  grammars  used  by  the  speech  system  to 
tighten  them  sufficiently,  a very  difficult  process  for  large  complex 
grammars. 

k.  SMALLGRAM 

As  we  gained  experience  with  the  speech  system,  several  problems 
became  apparent.  Because  of  the  way  the  parser  was  constructed,  a path 
through  a constituent  had  to  be  completed  before  tne  actions  on  the  arcs  of 
the  path  could  be  executed.  Thus  it  was  difficult  to  apply  syntactic  and 
semantic  constraints  as  early  as  one  would  have  liked.  For  example,  it  was 
impossible  to  eliminate  a sequence  like  "a  trip  for  my  recent  Chicago  trip” 
until  it  had  been  completely  parsed.  Similarly,  long  strings  of 
incompatible  adjectives  ("prst  left  available  high  expensive”)  would  be 
constructed  before  a semantic  check  could  be  made. 

To  eliminate  this  problem,  two  actions  were  taken.  One  was  to  begin 
work  on  a parser  that  could  take  more  context  into  account  as  it  parsed 
islands.  Th^s  parser  has  been  described  in  detail  in  Section  A.  The  other 
was  to  write  a new  grammar  for  the  system  which  used  "semantic"  categories 
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on  its  arcs  as  well  as  the  traditional  syntactic  ones.  The  dictionary  was 
modified  to  mark  entries  with  such  new  semantic  categories  as  CONVEYANCE 
(bus,  train),  DUHAT10N  (year,  week),  NUM-ADJ  (nearly,  almost,  under), 
SEASON  (fall,  winter),  SPONSOR  (IFIP,  ACL),  and  TRIP-ADJ  (remaining, 
European,  taken).  All  syntactic  parts  of  speech  were  retained. 

The  grammar  written  to  take  advantage  of  this  additional  information 
was  initially  called  SMALLGRAM  in  recognition  of  the  fact  that, 
intuitively,  it  was  smaller  in  scope  that  SPEECHGRAMMAR . It  contained  a 

large  proportion  of  WRD  arcs  and  used  both  syntactic  and  semantic 
categories  on  CAT  arcs.  Most  importantly,  the  set  of  constituents  pushed 
for  was  expanded  to  include  such  "semantic  constituents"  as  MEETING,  TRIP, 
PERSON,  etc. 

To  illustrate  the  qualitative  difference  between  these  two  approaches, 
Figure  1 shows  a schematic  for  simple  noun  phrases  as  were  used  in 
SPEECHGRAMMAR.  Figure  2 shows  a corresponding  grammar  fragment  such  as 
that  used  in  SMALLGRAM. 


Fig.  1.  A portion  of  a "syntactic"  ATN  grammar. 


Fig.  2.  A portion  of  a "semantic"  ATN  grammar. 
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While  the  structures  produced  by  3MALLGRAM  were  still  syntactic  parse 
trees,  they  were  guaranteed  to  be  interprctable . In  addition,  sentences 
that  were  syntactically  ambiguous  in  SPEECHGFAMMAR  (e.g.,  "The  money  was 
spent  by  John"  and  "The  money  was  spent  by  July")  were  unambiguous  in 
3MALLGRAM  because  the  semantic  information  needed  to  determine  correct 
scope  and  modifier  placement  was  built  in. 

There  are  several  drawbacks,  however,  to  a SMALLGRAM-type  semantic 
grammar.  One  is  that  the  grammar  grows  very  rapidly  as  it  is  extended  to 
handle  a larger  class  of  utterances.  In  fact,  to  handle  the  same  set  of 
meaningful  utterances  as  an  equivalent  syntactic  grammar  a much  larger 
semantic  grammar  is  required.  In  addition,  the  task-specific  nature  of  a 
semantic  grammar  means  that  it  must  be  written  anew  for  each  different 
application  domain.  It  has,  however,  a significant  advantage  for  speech 
understanding:  it  provides  a uniform  framework  for  applying  constraints 

that  take  into  account  semantic  and  pragmatic  as  well  as  syntactic 
information . 

5.  BIGGRAM 

While  3MALLGRAM  used  both  semantic  and  syntactic  information  in 
determining  and  parsing  an  utterance,  its  final  output  was  only  a syntactic 
tree.  Thus,  information  was  being  thrown  away  which  had  to  be  recovered 
later  by  semantic  interpretation  rules.  We  decided  to  eliminate  this 
middle  step  by  modifying  the  grammar  to  produce  semantic  interpretations 
directly.  This  not  only  speeded  up  the  system  but  reduced  its  size  by  an 
entire  TENEX  fork. 

The  resulting  grammar  builds  interpretations  by  accumulating  in 
registers  the  semantic  head,  quantifier,  and  links  of  the  nodes  being 
described  in  the  sentence.  For  example,  the  sentence 

"I  will  go  to  Chicago  for  the  ASA  meeting." 
yields  the  interpretation 

(FOR:  THE  X / ( FINDQ : LOCATION  (CITY  CHICAGO))  : T; 

(FOR:  THE  Y / (FINDQ:  DB/MEETING  (SPONSOR  ASA))  : T; 

(BUILDw:  DB/TRIP  DESTINATION  X) 

(TRAVELER  SPEAKER) 

TO/ATTEND  Y) 

(TIME  (AFTER  NOW)))) 
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This  interpretation  is  huiit  up  in  the  following  way.  The  PUGH  arc 
that  looks  for  a constituent  describing  a person  at  the  start  cf  the 
sentence  will  transform  the  pronoun  "I"  into  the  link-node  pair  (TRAVELER 
SPEAKER)  and  return  this  as  the  interpretation  of  that  corstituent.  The 
word  "will"  adds  (TIME  (AFTER  NOW))  to  the  list  of  link-node  pairs  being 
accumulated.  (The  grammar  does  not  accept  constructions  like  "will  have 
gone,"  so  "will"  can  currently  always  be  interpreted  as  marking  a future 
event.)  The  word  "go"  sets  the  semantic  head  to  DB/TR1P.  The  constituent 
"to  Chicago"  parses  witn  t ^ interpretation 

(DESTINATION  (!  THE  X LOCATION  (CITY  CHICAGO)))). 

(The  ! indicates  that  a FOR:  expression  will  have  to  be  built  as  part  of 
the  interpretation.)  Similarly,  "the  ASA  meeting"  produces 

(TO/ATTEND  (!  THE  Y DB/MEET1NG  ((SPONSOR  ASA)))). 

The  top  level  of  the  grammar  has  thus  accumulated  the  link-node  pairs 

(DESTINATION  (I—)) 

TIME  — ) 

(TRAVELER  SPEAKER) 

(TO/ATTEND  (!  — ))) 

with  the  semantic  head  DB/TRIP.  The  appropriate  action  (in  this  case  a 
BUILDQ:)  is  created,  and  the  necessary  quantificational  expressions  are 
expanded  around  it. 

Currently,  each  level  of  the  grammar  produces  the  regular  syntactic 
parse  tree  and  pops  the  semantic  interpretations  that  it  has  built  in 
parallel  as  a feature.  As  soon  as  the  grammar  has  been  thoroughly  checked 
out,  the  syntactic  registers  and  parse  trees  will  be  eliminated. 

h new  action  named  VERIFY  was  also  introduced  into  the  grammar.  Its 
effect  is  to  abort  an  arc  if  its  argument  evaluates  to  NIL,  which  allowed 
f He  test  component  of  every  arc  to  be  removed.  Now  the  third  and  all 
subsequent  elements  of  any  arc  are  actions,  any  of  which  may  be  a VERIFY. 

Because  the  grammar  was  also  being  enlarged  to  accommodate  a larger 
vocabulary  at  the  same  time  these  changes  were  being  made,  it  was  renamed 
B1GGRAM.  (Two  of  the  networks  in  BIGGRAM  are  pictured  in  Figures  3 and  4. 


portion  of  the  sentence-level  network. 
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The  first  recognizes  the  latter  part  of  questions  that  have  begun  with  the 
word  "what",  and  the  second  recognizes  noun  phrases  denoting  trips.  Arcs 
that  are  not  also  part  of  MIDGRAM  (see  Sec.  6 below,  have  been  indicated 
with  dashed  lines. ) 

The  interpretations  that  the  parser  builds  are  forms  in  a command 
language  that  is  described  fully  in  Volume  5 of  this  report.  Highlights 
will  be  given  here  in  order  to  discuss  the  way  in  which  the  grammar  builds 
the  interpretations.  The  most  common  form  created  by  the  parser  is 

(FOR:  quant  var  / (FINDQ:  nodetype  (link  value)...) 

: test  ; action). 

This  is  very  similar  to  semantic  interpretations  produced  by  the  LUNAR 
system  [Woods  et  al . , 1972]. 

The  function  FINDQ:  is  a generating  function  that  enumerates  nodes  of 
the  named  type  having  the  appropriate  links.  For  example,  a node 
representing  a trip  that  was  already  taken  could  be  found  by  (FINDQ: 
DB/TRIP  ( MODALITY  TAKEN)).  The  quantifier,  quant . controls  the  calling  of 
FINDQ:.  Each  value  returned  by  FINDQ:  is  bound  to  the  indicated  variable, 
var . which  may  then  be  used  in  the  test  (a  further  restriction  on  the  value 
produced  by  FINDQ:)  and  the  action ■ 

Interpretations  may  be  nested.  For  example,  "What  is  the  cost  of  my 
trip?"  has  the  interpretation 

(FOR:  THE  A0127  / (FiNDQ:  DB/TRIP  (TRAVELER  SPEAKER))  : T ; 

(FOR  THE  A0128  / (FINDQ:  DB/COST  (COST/OF  A0127))  : T ; 

(OUTPUT:  AO  1 26 ) ) ) 

Another  common  interpretation  form  is  TEST:  which  prints  "yes"  or  "no" 
depending  on  the  Boolean  value  of  its  argument.  So  "Are  we  over  our 
budget?"  has  the  interpretation 

(FOR:  THE  A0106  / (FINDQ:  DB/BUDGET  (BUDGET/FOR  SPEAKER))  : T ; 

(TEST:  (OVERBUDGET?  AOlOb))) 

In  order  to  modify  the  data  base,  there  is  a functdof.  3UILDQ:  which  is 
like  FINDQ:  except  that  it  creates  a node  rather  than  finds  one  in  the 
network.  For  example,  "Enter  two  trips"  would  parse  into 
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(FOR:  2 A0 1 38  / (BUILDQ:  DB/TRIP)  : T ; T) 

The  only  other  unusual  interpretation  form  involves  the  functions 
COUNT:  and  SETOF:  which  are  used  to  answer  "how  many"  questions.  For 
example,  "How  many  trips  were  taken?"  would  produce  the  interpretation 

(FOR:  THE  A0079  / (SETOF:  (FINDQ  DB/THIP  (TIME  (BEFORE  NOW)))) 

: T ; (OUTPUT:  (COUNT:  A0079))) 

For  most  sentences,  building  semantic  interpretations  in  registers  cn 
arcs  of  the  grammar  is  extremely  straightforward.  For  each  semantic  entity 
like  TRIP  and  BUDGET,  a FOR:  form  will  eventually  be  developed,  so  the 
information  that  must  be  accumulated  is  a quantifier,  a head  (which  will 
become  the  node  type  in  the  FINDQ:  or  BUILDQ:),  a set  of  semantic 

link-value  pairs,  and,  perhaps,  a test  and  an  indication  of  which  function 
(if  not  FINDQ:)  is  to  be  used  in  the  body  of  the  form.  The  register  names 
3EMQUANT,  SEMHEAD,  SEMLINKS,  3EMTEST,  and  SEMFORM  are  used  to  hold  these 
values.  The  register  SEMVAR  is  occasionally  used  to  hold  the  variable, 
which  is  created  via  the  LISP  function  GENSYM.  When  a constituent  has 
been  parsed,  a pseudo-interpretation  is  constructed  on  the  POP  arc.  This 
is  done  since  not  all  the  information  may  be  available  at  the  given 
constituent  level.  For  example,  in  the  sentence,  "How  many  trips  did  John 
take  in  July?",  the  initial  PUSH  for  a trip  will  find  "how  many  trips"  and 
the  information  from  the  rest  of  the  sentence  (traveler  and  when)  will  have 
to  be  added  at  the  S/  level.  A pseudo-interpretation  is  a list  of  the  form 

(!  quant  var  head  semiinks  test  function) 

The  ! identifies  this  as  a pseudo-interpretaticn.  The  quantifier, 
variable,  and  head  are  required,  but  the  links,  test,  and  function  may  be 
empty.  If  the  function  is  empty,  FINDQ:  is  inserted  in  the  final  form, 
otherwise,  the  indicated  function  (generally,  BUILDQ:)  is  used. 

In  cases  where  the  constituent  may  be  augmented  at  a higher  level,  the 
PUSH  arc  that  receives  this  pseudo-interpretation  may  take  it  apart  and  put 
the  values  back  in  registers  at  the  higher  level  so  that  they  may  be  added 
to  or  changed.  If,  instead,  the  constituent  modifies  something  else,  the 
pseudo-interpretation  will  be  put  into  a new  register  (most  likely  the 


-32- 


BBN  Report  No.  3^38  - Vol . IV 


Bolt  Beranck  and  Newman  Inn. 


SLMLINKS  register  of  the  higher  level).  For  example,  the  fragment 
"Chicago”  would  be  parsed  by  the  CITY/  network  into  (!  THE  A0021 
DB/LOCATION  ((CITY  CHICAGO))).  On  a PUSH  arc  in  the  TRIP/  network  ("a  trip 
to  Chicago"),  this  would  be  added  to  the  SEMLINKS  register  under  the  link 
DESTINATION  so  that  the  eventual  interpretation  of  the  TRIP/  network  would 
be 

( ! A0022  DB/THIP  ((DESTINATION  (!  THE  A0021  DB/LOCATION  ((CITY 

CHICAGO)))))) 

This  in  turn  may  be  embedded  in  yet  another  constituent. 

Eventually,  the  top  level  of  the  grammar  will  create  such  a 
pseudo-interpretation  together  with  an  action  (usually  deriving  from  the 
main  verb)  in  the  ACTION  register.  At  the  final  POP  a function  called 
INTEHPBUILD  is  called  that  takes  this  form  and  action  and  produces  the 
corresponding  real  interpretation  in  the  command  language  described  above. 
It  turns  the  pseudo-interpretation  inside  out  so  that  every  embedded  (! 
...)  form  becomes  an  enclosing  (FOR:  ...)  form  in  which  the  original 

pseudo-interpretation  is  replaced  by  the  variable  it  contains.  For 
example,  "Show  me  Bill's  trip  to  Washington"  produces  the 
pseudo -interpretation 

( ! THE  A0058  DB/TRIP  ((TRAVELER  (!  THE  A0057  DB/PERSON 

((FIRSTNAME  BILL)))) 

(DESTINATION  (!  THE  A005o  LOCATION 

((AMBIG  WASHINGTON))))) 

which  (together  with  the  action  (OUTPUT:  A0058))  is  transformed  into 

(FOR:  THE  A0056  / (FINDQ:  LOCATION  (AMBIG  WASHINGTON)) 

: T : (FOR  THE  A0057  / (FINDQ:  PERSON  (FIRSTNAME  BILL)) 

: T : (FOR  THE  A0058  / (FINDQ  : DB/TRIP  (DESTINATION  A0056^ 
(TRAVELER  A0057)) 

: T ; (OUTPUT:  A0058)))) 

Building  semantic  interpretations  in  this  manner  is  almost,  but  not 
quite,  as  easy  as  it  seems,  In  some  cases  the  correct  quantifier  for  a 
constituent  cannot  be  determined  until  more  global  information  is 
available.  For  example,  "a  trip"  should  be  interpreted  generically  in  a 

declarative  sentence  ("A  trip  to  Chicago  costs  ..."),  as  "one  trip"  in  an 
imperative  sentence  with  a "constructive"  verb  ("Schedule  a trip...")  and 
as  "some  trip"  in  a yes/no  question  ("is  there  a trip...?").  This  is 
accomplished  by  having  a pseudo-quantifier,  SOME/ONE,  which  is  later 
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modified  to  eithe”  SOME  or  ONE.  (Generic  js  interpreted  as  ONE.) 
Sinilarly,  a pseudo-quantifier  HOWMANY  is  allowed  in  the  construction  of 
pseudo-interpretations  of  noun  phrases  which  is  later  transformed  into  the 
appropriate  structure  using  COUNT:  and  SE  F:  (see  above). 

here  are  a number  of  advantages  to  this  scheme.  In  fact,  if  the 
grams  ad  been  written  from  scratch  instead  of  being  extended  f-’om 
SMALLGhf...  which  built  both  syntactic  parse  trees  and  semantic 
interpretations,  it  would  have  been  much  smaller.  A new  and  specialized 
function  was  written  to  add  link-value  pairs  to  the  SEMLINKS  register, 
rather  than  always  u.r  ng  ADDL.  This  function,  INCORP,  incorporates  the 
new  pair  afte;  checking  to  see  that  information  was  not  being  duplicated  in 
an  invalid  way.  For  example,  the  sentence  "John's  trip  that  Bill  took" 
world  be  extremny  difficulty  to  eliminate  on  syntactic  grounds  alone,  but 
semantically  it  would  involve  two  different  (TRAVELER  x)  pairs  on  the 
SEMLINKS  list.  INCORP  checks  to  see  if  the  pair  it  is  adding  is  already 
represented  on  the  list  and  aborts  the  arc  if  it  is  incompatible.  In  the 
case  of  time  expressions,  however,  INCORP  must  perform  a more  complex 
check,  because  while  time  expressions  may  be  picked  up  at  several  places 
in  the  sentence,  only  the  most  specific  should  be  kept.  Thus,  "How  many 
trips  did  John  take?"  should  contain  an  indication  that  the  trips  were  in 
the  past,  but  "How  many  trips  did  John  take  in  July?"  should  contain  (TIME 
LAST  (MONTH  JULY)),  not  (TIME  (BEFORE  NOW)). 

Information  about  dialogue  dependent  matters  resides  in  the  TRIP  fork, 
and  there  are  several  places  where  the  grammar  calls  it  directly  in  order 
to  make  pragmatic  checks  on  a particular  parse  pata.  For  example,  the 
phrase  "that  meeting"  should  only  be  allowed  if  the  sentence  is  part  cf  a 
dialogue  in  which  a meeting  has  been  mentioned.  The  action  (VERIFY 
(ASKFORK  TRIP  (FOR:  SOME  X / (FINDQ : DB/MEETING)  ; T : (INFOCUS?  X))))  is 
used  on  an  arc  to  make  this  inquiry.  If  the  answer  is  false,  t^e  arc  is 
aborted.  (In  single  sentence  mode,  the  function  INFOCUS?  is  made  to  return 
T) . Note  that  this  check  can  be  made  as  soon  as  the  word  "that"  is 
considered;  it  is  not  necessary  to  process  as  far  as  "meeting",  since  it  i3 
being  picked  up  in  the  MEETING/  network. 
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Similarly,  cities  and  states,  cities  and  countries,  and 
f irstname/lastname  pairs  must  denote  entities  known  to  the  system.  This  is 
checked  by  the  function  GOODPLACE  (or  GQODNAME)  which  calls  the  THIP  fork 
to  validate  the  combinations  proposed  by  the  grammar. 

This  sort  of  checking  must  be  done  with  care.  For  example,  "two  trips 
to  Baghdad"  should  not  be  verified  pragmatically  until  a larger  context  is 
established,  since  the  phrase  by  itself  cannot  indicate  whether  they  are 
trips  that  the  system  should  know  about  (e.g.,  "Cancel  two  trips  to 
Baghdad."),  may  not  know  about  (e.g.,  "Are  there  two  trips  to  Baghdad  in 
the  budget?")  or  should  not  know  about  (e.g.,  "Add  two  trips  to  Baghdad."). 

6.  MIDGRAM 

To  experiment  with  grammars  of  different  sizes,  it  was  decided  to 
create  a grammar  called  MIDGRAM,  which  is  a subset  of  BIGGRAM.  To 
facilitate  this  operation,  a pseudo-action  NOTIN  was  created.  NOTIN 
apoears,  if  at  all,  as  the  third  element  of  an  arc  and  has  as  arguments  the 
grammars  that  do  not  include  this  arc.  (It  was  envisioned  that  someday  a 
TINY3RAM  would  exist  also.)  A procedure  was  written  to  extract  a grammar  X 
from  BIGGRAM  by  eliminating  from  BIGGRAM  all  arcs  which  contained  a (NOTIN 
...X...)  action.  If  all  arcs  were  removed  from  a state,  the  state  itself 
was  removed  from  the  resulting  grammar.  Appendix  1 gives  a listing  of 
MIDGRAM.  Appendix  2 shows  some  sample  parses  using  BIGGRAM.  The  following 
chart  compares  several  of  the  grammars  used  by  the  speech  system. 


BIGGRAM 

MIDGRAM 

SMALLGRAM 

SPEECHGRAMMAR 

^staves 

JUU8 

337 

399 

83 

#arcs 

881 

6U8 

773 

202 

^pbdrys 

77 

62 

0 

0 

factions 1 

2280 

16I49 

1I467 

386 

0CAT  arcs 

62 

55 

60 

Uo 

#WRD  arcs 
Swords  in 

1*39 

306 

366 

52 

TRAVELDICT  used2 
Swords  in 

1409 

1j22 

“ 

BIGDICT  used3 

1097 

- 

- 

- 

(blanks  indicate  not  applicable  or  measurements  not  made.) 

1 excluding  NOTIN  and  PBDRY  pseudoactions. 

2 using  the  expanded  version  of  TRAVELDICT  which  contains  667  words. 

3 using  the  expanded  version  of  BIGDICT  which  contains  1363  words. 
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7 • I unct ions  U sed  By  The  Grammar 

(AHTAGRE'E  det  wrd ) predicate  which  checks  phonological  compatiu  il  ity  of  an 
indefinite  article  with  the  word  following  it.  Will  screen  out  "a 
item",  etc. 

(ASKFORK  fork  form)  calls  the  indicated  LISP  fork  to  evaluate  form . 

(CATCHECK  word  category)  predicate  to  test  whether  word  has  the  part  of 
speech  category . 

( CATF  word  category  feature)  returns  the  value  of  feature  for  word  as  the 
part  of  speech  category. 

(CONJOINABLE  args)  takes  a list  of  possibly  conjoined  phrases  and  verifies 
tha^  no  two  are  the  same,  an  that  if  the  first  one  is  a pronoun,  it 
isn  t "I"  or  "me"  and  all  the  others  must  be  pronouns;  rejects 
combinations  like  "them  and  them". 

( GCODNAME  first  last)  calls  TRIP  fork  to  verify  that  first  and  last  names 
go  together. 

(GOODPLACE  city  state)  calls  TRIP  fork  to  verify  that  city  and  state 
(which  may  actually  be  a country)  go  together. 

(INCORP  reg  pair)  adds  pair  to  the  contents  of  the  named  register  reg  only 
if  the  links  don't  conflict  (see  discussion  above). 

(INTERBUILD  form)  when  form  is  (!  ...)  produces  an  expression  in  command 
language . 

(MERGE  reg  newlist)  calls  INCORP  to  add  each  element  (pair)  of  newlist  to 
the  register  reg . 

(NOTIN  grammar)  a pseudo-action  used  to  remove  arcs  to  get  a smaller 
grammar . 

(NUMBOF  form)  returns  the  number  represented  by  form. 

( PBDRY ) checks  for  a prosodic  boundary  preceding  the  current  word.  See 
Section  IV.  C. 

( PNCHECK  nounphrase  code)  checks  person-number  agreement  between  the 
nounphrase  and  the  code  that  was  obtained  from  the  verb;  e.g., 
"travel*  has  code  X 3SG  because  it  can  go  with  any  form  except  3rd 
person  singular. 

(RIGHTENDP)  returns  T if  the  end  of  the  utterance  has  been  reached; 
returns  NIL  otherwise. 

(SCOPE  [statel  ...  staten]  form)  requires  that  one  of  the  named  states 
has  been  encountered  on  the  path  to  the  left  before  evaluating  form . 
This  is  useful  when  parsing  right  to  left  when  form  uses  regi sters 
that  are  set  in  a state  in  the  list. 

(SEMPRO  pronoun)  returns  the  form  of  the  pronoun  to  be  used  in  building 
semantic  interpretations,  e.g.,  I,  ME  ->  SPEAKER;  HE  ->  (!  THAT 
PERSON  ((GENDER  MALE)));  etc. 

(SPREADR  [regi  ...  regn]  form)  takes  the  value  of  form  which  should  be  a 
list  of  elements,  and  sets  the  registers  to  those  elements  in  pairwise 
fashion.  It  calls  INCORP  ratner  than  SETR  if  the  register  name  is 
SEMLINKS. 

(TIMESENSE  tense  form)  tests  that  tense  does  not  conflict  with  a time  word 
(e.g.,  next,  since)  in  form. 

( TR1P-ADJ-CK)  checks  compatibility  of  the  modifiers  in  a trip  phrase  to 
screen  out  combinations  like  "previous  taken"  or  "European  domestic". 
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(VERIFY  form)  evaluates  form  and  causes  the  arc  to  be  aborted  if  the  value 
is  NIL. 

( WHOLEPROJ?  form)  returns  T if  form  is  a complete  project  name. 
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C.  FROSOBICS 

One  source  of  knowledge  available  to  speech  understanding  systems 
is  the  interpretation  of  the  supra  segmental  information  contained  in 
the  fundamental  frequency  contour  of  a sentence.  Lea's  earlier 
research  [Lea,  1972,  1973;  Lea  et  al.,  1973]  showed  that  a decrease  in 
fundamental  frequency  usually  occurs  at  the  end  of  each  major 
syntactic  constituent,  with  an  increase  usually  near  the  beginning  of 
the  next  one.  He  proposed  an  algorithm  for  "detecting"  syntactic 
boundaries  by  recognizing  this  fall-rise  pattern  in  Fq.  Phonetic 
effects  (especially  unvoiced  consonants)  can  also  jause  such  a 
fall-rise  pattern,  but  the  effect  is  generally  somewhat  smaller,  so 
they  can  be  screened  out  by  requiring  that  an  Fo  decrease  exceed  a 
"fall  threshold"  and  that  an  Fq  increase  exceed  a "rise  threshold"  in 
order  for  a boundary  to  be  recognized. 

Our  earlier  work  with  Lea's  algorithm  [Bates  and  Wolf,  1975] 
poi-ted  out  problems  with : 

1.  The  accuracy  of  the  Fq  data  produced  by  our  Signal  Processing 
component.  (This  has  since  been  largely  remedied  by  a newer  F 
extraction  algorithm,  described  in  Vol.  I,  Sec.  A.) 

2.  The  interpretation  of  the  boundary  locations  detected  by  the 
algorithm. 

2a.  Not  all  syntactic  boundaries  appeared  to  be  marked  by 
prosodic  boundaries. 

2b.  The  relationship  between  t iie  location  of  a prosodic  boundary 
found  by  the  algorithm  and  the  location  of  the  corresponding 
syntactic  constituent  was  far  from  precise.  Depending  on 
intonation  contour,  syllable  stress,  and  phonetic  effects, 
the  prosodic  boundary  could  occur  either  before  or  after  the 
syntactic  boundary. 

3.  Even  assuming  solutions  to  the  above  problems,  any  prosodic 
boundary  location  algorithm,  including  Lea's,  will  make  a certain 
number  of  errors.  The  questions  then  are  how  much  undertainty  is 
tolerable  for  prosodies  to  be  still  useful  and  how  can  prosodies 
best  be  used  in  the  process  of  speech  understanding. 

During  the  past  year,  we  have  been  working  with  the  Univac  group 
on  the  latter  two  problems.  This  interaction  led  to  some  specific 
proposals  on  iitting  a prosodies  knowledge  source  using  Lea's  boundary 
detection  algorithm  into  the  HWIM  system.  Such  a prosodies  component 
was  implemented  and  operationally  checked  out  toward  the  end  of  the 
project,  but  there  was  no  time  to  evaluate  its  contribution  to  the 
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speech  understanding  process.  This  section  describes  this  recent  work 
in  prosodies. 

The  initial  stage  of  the  Univac-B3N  effort  was  for  the  Univac 
group  to  study  the  operation  of  the  current  version  of  HWIM, 
particularly  with  respect  to  the  grammar  (SMALLGRAM  at  the  time)  and 
the  operation  of  the  Parser.  The  fact  that  at  the  time  all  aspects  of 
the  system  (especially  the  grammar)  were  steadily  being  modified  did 
not  make  their  job  any  easier.  However,  the  basic  way  the  system 
worked  remained  conerent  enough  for  their  conclusions  to  remain  valid. 
The  Univac  study  resulted  in  one  principal  and  several  secondary 
proposals  for  using  prosodic  knowledge  in  HWIM  [Lea,  1976].  Since 
there  was  little  time  remaining,  we  decided  to  adopt  the  principal 
proposal,  as  it  appeared  to  offer  practical  solutions  to  the  problems 
enumerated  above . 

The  Univac  group  noted  those  arcs  of  SMALLGRAM  that  were  expected 
to  be  accompanied  by  prosodic  boundaries.  Specifically,  a prosodic 
boundary  would  be  expected  to  occur  immediately  before  any  word  that 
caused  the  arc  to  be  take!!,  where  "immediately  before"  remained  to  be 
defined  precisely.  This  specification  narrowed  the  uncertainty  about 
where  the  prosodic  boundaries  were  predicted  to  occur  from  the  level 
of  a constituent  to  the  level  of  a word,  and  it  related  those 
predictions  directly  to  our  grammar. 

On  each  prosodically  marked  arc  of  the  grammar  there  is  a call  to 
the  function  PBDRY.  PBDRY  checks  whether  a prosodic  boundary  occurs 
to  the  left  of  the  word  that  caused  the  arc  to  be  taken.  If  a 
boundary  was  found  where  predicted,  a positive  score  increment  is 
added  to  a prosodic  score  that  the  Parser  maintains  for  each  parse 
path;  if  no  boundary  was  found,  a negative  adjustment  is  made  to  the 
prosodic  score.  This  prosodic  score  is  then  sent  to  the  Control 
component,  where  it  is  given  to  each  event  associated  with  a word 
match  that  results  from  the  accompanying  syntactic  prediction. 

More  oi ten  that  not,  a new  word  can  be  added  to  a theory  in 
several  different  ways.  That  is,  the  existing  set  of  parse  paths  can 
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be  extended  along  several  different  arcs  that  can  consume  the  new 
word.  As  many  of  these  arcs  may  be  marked  pro sodically , a theory  may 
be  assigned  more  than  one  prosodic  score.  The  Parser  only  returns  the 
best  of  these,  since  Control  does  not  distinguish  among  parse  paths. 

The  Univac  grammar  work  resulted  in  about  1/6  of  the  SMALLGRAM 
arcs  bearing  prosodic  boundary  predictions  (Since  the  grammar  was 
continually  changing,  we  could  not  expect  them  to  keep  current  on  it. 
When  we  changed  to  BIGGRAM,  we  "transliterated"  the  prosodic 
predictions  from  SMALLGRAM,  since  there  was  not  time  for  another 
interaction  with  the  Univac  group.)  Even  though  only  a small  portion 
of  the  grammar  is  affected,  it  turns  out  that  all  but  very  short 
sentences  generally  contain  at  least  one  or  two  prosodic  predictions. 

The  original  Univac  proposal  was  for  the  prosodic  checks  to  be 
performed  when  events  were  put  into  the  event  queue  [Lea,  1976]. 
Prosodic  score  adjustments  could  then  have  the  effect  of  reordering 
the  queue  so  as  to  decrease  the  possibility  of  processing  incorrect 
events.  While  applying  knowledge  early  is  certainly  desirable, 
examining  the  grammar  for  these  prosodic  checks  is  tantamount  to 
parsing.  Thus,  it  seemed  more  natural  and  economical  to  incorporate 
prosodic  evaluation  into  the  syntactic  processing  of  the  event,  as 
described  above  . 

We  now  describe  the  way  in  which  the  function  PBDRY  determines  if 
a prosodic  boundary  is  present  "immediately  before"  a word.  After 
RTIME  (Vol.  II,  Sec.  A)  has  computed  the  parameters  from  the  incoming 
speech  signal,  the  prosodies  component  is  called.  (This  is  presently 
a separate  program,  for  experimental  convenience,  but  conceptually  it 
could  be  a subroutine  within  RTIME.)  This  routine  calls  BBN 
implementations  of  Univac 's  boundary  detection  and  syllabification 
routines,  B0UND3  and  CHUNK,  which  use  Fq  and  energy  in  the  band 
60-3000  Hz  to  produce  prosodic  boundary  and  syllable  locations.  It 
then  "expands"  each  prosodic  boundary  from  specifying  a single  point 
to  specifying  an  interval  in  time  over  which  the  boundary  applies. 
Figure  1 illustrates  the  extent  to  which  the  ends  of  the  "boundary 
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interval"  are  moved  to  the  right  and  left  so  that  the  interval  covers 
the  two  syllables  nearest  the  boundary. 


Fig.  1.  Illustration  of  how  the  prosodic  boundary  location 
determined  by  B0UND3  (shown  by  the  vertical  line)  is 
"expanded"  to  cover  the  nearest  two  syllables. 

If  the  interval  so  defined  includes  a local  maximum  in  FO  (also 
determined  by  BOUN03),  the  interval  is  truncated  at  that  point,  since 
the  influence  of  a prosodic  boundary  caanot  extend  past  it.  Any 
boundaries  found  too  near  the  beginning  or  end  of  the  sentence  to 
cover  two  syllables  as  shown  in  Fig.  1 are  anomalous  and  therefore 
ignored.  The  times  of  the  start  and  end  of  each  boundary  interval  are 
then  written  on  a .StiND  file  (for  "spread  boundaries")  for  subsequent 
use  by  the  function  PBDRY  in  the  Parser. 


In  order  to  determine  whether  a word  is  "immediately  preceded"  by 
a prosodic  boundary,  PBDRY  makes  two  checks: 

a)  If  the  starting  time  of  the  word  falls  within  a boundary 
interval,  then  the  answer  is  YES. 

b)  If  the  word  does  not  carry  primary  stress  on  its  first  syllable 
(information  available  from  the  dictionary),  then  the  prosodic 
boundary  may  occur  within  the  word,  in  front  of  the  first 
stressed  syllable.  In  the  current  HWIM  system,  the  times  of  the 
syllables  are  not  available  to  the  Parser,  so  a much  less  precise 
check  is  made:  in  the  case  of  non-initial  stress,  if  there  is  any 
overlap  between  the  boundary  interval  and  the  word,  the  answer  is 
YES. 
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c)  Otherwise,  the  answer  is  NO. 

The  Univac  group  made  estimates  of  +50  and  -50  for  the  prosodic 

score  adjustments,  based  on  the  sizes  of  scores  they  observed  on  HWIM 

trace  listings.  The  likelihood  ratio  scoring  method  used  in  HWIM 

provides  a rational  basis  for  assigning  such  scores,  based  cn  prosodic 

component  performance,  as  follows: 

P(boundary  is  found  | arc  predicts  a boundary) 

— ----  (1) 

P(bouidary  is  found) 

P(no  boundary  is  found  ! arc  predicts  a boundary) 

(2) 

P(no  boundary  is  found) 

A first  approximation  may  be  made  from  performance  measures  cited  by 
Lea,  which  do  not  take  into  account  the  particular  boundary  location 
to  word  location  algorithms  described  above.  Lea  cites  previous 
studies  that  show  that  about  90J  of  the  expected  boundaries  are  found; 
using  this  figure  assumes  that  Univac 's  marking  of  our  grammar  works 
equally  well.  The  denominator  of  (1)  should  be  the  proportion  of 
boundaries  found  if  they  were  sought  on  every  word  hypothesized,  not 
just  on  those  with  marked  grammar  arcs.  Lea's  figure  of  1/6  of  the 
arcs  of  tne  grammar  marked  is  certainly  much  too  low  an  estimate  of 
this.  If  we  pick  twice  this  value  as  a guess,  then  (1)  and  (2)  become 
0.9/0-33  = 2.7  and  0.1/0.67  = 0.15.  To  simplify  arithmetic,  we  use 
100  times  the  log  of  the  likelihood  ratio,  so  (1)  and  (2)  translate  to 
+M  and  -82,  respectively  (not  far  from  Univac's  guesses).  Obviously, 
performance  estimates  derived  from  our  own  implementation  would 
produce  better  estimates,  which  would  probably  be  smaller,  due  to  the 
imprecision  of  the  boundary  spreading.  (The  B0UND3  routine  also 
produces  "confidence"  estimates  for  its  boundary  locations,  which 
could  be  factored  into  these  probability  estimates  as  well.) 

Unfortunately,  time  did  not  permit  us  to  carry  the  work  on  the 
prosodies  component  as  far  as  testing  its  effectiveness  in  aiding  the 
speech  understanding  process.  In  addition  to  that  obvious  task,  there 
is  more  work  on  prosodies  that  should  be  done.  Three  topics 
specifically  related  to  the  work  described  above  are: 
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1)  Refinement.  of  the  grammar-based  predictions  of  prosodic 

boundaries.  Lea  was  unfamiliar  with  prosodie  effects  in  some 
categories  of  our  grammar  ( sueh  as  numbers  and  dates),  so  some  of 
the  predictions  bear  less  eonfidenee  than  others.  Studies  of 
travel  budget  sentences  would  lead  to  better  predictions. 

2)  Refinement  of  the  relationship  of  the  locations  of  the  prosodic 
boundaries  and  the  grammatical  boundaries.  The  "boundary 
expansion"  and  PBDRY  eheeking  aigorithms  described  above  are 
obviously  rather  impreeise. 

3)  More  extensive  testing  of  the  prosodies  component  would  leao  to 
more  aeeurate  estimation  of  the  proper  values  for  scoring 
prosodic  events. 


Other  possible  extensions  of  prosodic  knowledge  to  the  speech 
understanding  proeess  may  be  found  in  [Lea,  1976]. 
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Appendix  2_  - Sample  Parses 

The  following  sentences  were  parsed  in  text  mode  using 
BIGDICT  and  bIGGRAM.  For  each  sentence  the  syntactic  parse  tree 
is  shown  followed  by  the  semantic  interpretation  which  is  also 
created  by  the  parser.  The  optimized  interpretation  is  obtained 
from  the  original  interpretation  by  a function  in  the  TRIP  fork 
(see  Vol.  5,  Sec.  E.3  Tor  details). 

Comments  which  follow  the  output  are  enclosed  in  brackets, 
{},  and  were  added  to  explain  or  call  attention  to  interesting 
aspects  of  the  parsing  process  and  its  result.  Additional 
examples  can  be  found  ir  Vol.  5,  Sec.  1.2. 
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DO  WE  HAVE  A SURPLUS 


Found  complete  parse: 

S 0 

SUBJ  NP  PRO  YOU 

FEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V HAVE 

OBJ  NP  N SURPLUS 

FEATS  NU  SG 


Interpretation: 


[FOR:  SOME  A0027  / (FINDQ:  DB/GROUP  (MEMBERS  SPEAKER)) 

: T ; (FOR:  SOME  A0026  / (FINDO:  DB/CONTRACT  (GROUP  A0027)) 

: T ; (FOR:  THE  A0028  / (FINDQ:  DB/BUDGET 

(BUDGET/OF  A0026)) 

• t • 

(TEST:  (GREATERP  (GET:  A0028 

MONEY/REMAINING ) 

0] 


Optimized  interpretation: 

[FOR:  SOME  A0027  / (FIND:  (INSTANCE/OF  (QUOTE  DB/GROUP)) 

(MEMBERS  (QUOTE  SPEAKER))) 

: T ; (FOR:  SOME  A0026  / (FIND:  (INSTANCE/OF  (QUOTE  DB/CONTRACT)) 

(GROUP  A0027  ) ) 

: T ; 

(FOR:  THE  A0028  / (FIND:  (INSTANCE/OF  (QUOTE  DB/BUDGET)) 

(BUDGET/OF  A0026)) 

: T ; 

(TEST:  (GREATERP  (GET:  A0028 

(QUOTE  MONEY/REMAINING)) 
(QUOTE  0] 


[This  is  an  example  of  the  type  of  expansion  (of  the  word  "we") 
which  is  usually  left  for  the  TRIP  fork,  but  which  can  be  done  by 
the  grammar.  The  interpretation  could  have  been  built: 

(FOR  THE  AGO 27  / (FINDQ:  DB/BUDGET  (BUDGET/FOR  SPEAKER)) 

: T ; (TEST:  (GREATERP  (GET:  A0027  MONEY ( REMAINING ) 

0))  ) 

if  the  TRIP  fork  could  infer  the  meaning  of  a person  at  the  end 
of  a BUDGET/FOR  link.  This  shorter  form  is  created  in  other 
places  in  the  grammar.) 


SHOW  ME  HER  TRIP-S 


Found  complete  parse: 

S IMF 

SU6J  NP  PRO  YOU 

FEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V SHOW 

OBJ  NP  DPT  FOSS  HER 
N TRIP 
FEATS  NU  PL 


Interpretat ion: 

(FOR:  THAT  A002  1 / ( FINDQ  : DB/PERSON  (GENDER  FEMALE)) 

: T ; (FOR:  EVERY  A0022  / (FINDQ:  DB/TRIP  (TRAVELER  A0021)) 
: T ; (OUTPUT:  A0022))) 


Optimized  interpretation: 

(FOR:  THAT  A0021  / (FIND:  (INSTANCE/OF  (QUOTE  DB/PERSON)) 

(GENDER  (QUOTE  FEMALE' ) ) 

: T ; (FOR:  EVERY  A0022  / (FIND:  (INSTANCE/OF  (QUOTE  DB/TRIP)) 

(TRAVELER  A0021  ) ) 

: T • (OUTPUT:  AC022))) 


{Third  person  pronouns  are  expanded  into  an  interpretation  with 
the  quantifier  THAT  since  it  is  necessary  for  the  TRIP  fork  to 
have  an  antecedent  if  the  interpretation  is  to  execute  correctly. 
Another  way  of  handling  this  would  be  to  call  the  TRIP  fork  as 
soon  as  the  pronoun  is  found  to  verify  the  existence  of  an 
antecedent.  This  latter  method  is  used  for  phrases  like  "that 
meeting" . } 
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* WHICH  1RIP-S  WERE  CANCEL-ED 


Found  complete  parse: 

S 0 

SUBJ  NP  PRO  SOMEONE 
FEATS  NU  SG 
AUX  TNS  PAST 

VOICE  PASSIVE 
VP  V CANCEL 

OBJ  NP  DET  WHICH 
N TRIP 
FEATS  NU  PL 


Int  erpret  at ion  : 

(FOR:  EVERY  A0013  / (FINDQ:  DB/TRIP  (MODALITY  CANCELED) 

(TIME  (BEFORE  NOW))) 

: T ; (OUTPUT:  A 00 1 3 ) ) 


Optimized  interpretation: 


(FOR:  EVERY  A0013  / (FIND:  (INSTANCE/OF  (QUOTE  DB/TRIP)) 

(MODALITY  (QUOTE  CANCELED)) 
(TIME  (QUOTE  PAST))) 

: T ; (OUTPUT:  A0013)) 


("Which"  causes  the  ouantifier  EVERY  to  be  used,  since  it  is 
assumed  that  at  least  one  trip  meets  the  criterion.  If  this  is 
not  the  case  in  the  data  base,  a message  to  that  effect  will  be 
printed.  If  the  quantifier  ALL  had  been  used  in  that 
circumstance,  nc  action  would  be  taken  if  no  canceled  trips  were 
found  . ) 
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I WILL  FLY  TO  SANgDIEGO 


Found  complete  parse: 

S DCL 

SUB J NP  PRO  I 

FEATS  NUMBER  SG 
AUX  TNS  FUTURE 
VOICE  ACTIVE 
VP  ’’  FLY 

PP  PREP  TO 

NP  NP R LOCATION  CITY  SANgDIEGO 


Interpretation  : 


(FOR:  THE  A0050  / 
: T ; (FOR: 


(FINDQ:  LOCATION  (CITY  SANgDIEGO ) ) 

1 A0051  / ( BUILDQ : DB/TRIP  (TIME  (AFTER  NOW)) 

(DESTINATION  A0050) 
(MODE/OF/TRANSPORT  AIR) 
(TRAVELER  SPEAKER)) 

: T ; T ) ) 


Optimized  interpretation: 

(FOR:  THE  A0050  / (FINDLOC  (INSTANCE/OF  (QUOTE  LOCATION)) 

(CITY  (QUOTE  SANgDIEGO))) 

: T ; (FOR:  1 A0051  / (BUILD:  DB/TRIP  (TIME  (QUOTE  FUTURE)) 

(DESTINATION  A0050) 
(MODE/OF/TRANSPORT  (QUOTE  AIR)) 
(TRAVELER  (QUOTE  SPEAKER))) 

: T ; T)  ) 


{The  mode  of  transport  is  recognized  in  adjectives  ("the  bus 
fare"),  prepositional  phrases  ("by  train"),  and  verbs  ("fly"). 
First  person  pronouns  always  become  SPEAKER  in  the  interpretation 
for  the  convenience  of  the  TRIP  fork.) 
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MMIM 


• SHOW  ME  BILL  -S  TRIP  TO  WASHINGTON 


Found  complete  parse: 

S IMP 

SUBJ  NP  PRO  YOU 

FEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V SHOW 

OBJ  NP  DET  ART  THE 
N TRIP 
PP  PREP  FOR 

NP  NP  R NAME  FIRST  BILL 
PP  PREP  TO 

NP  NP  R LOCATION  AMBIG  WASHINGTON 
FEATS  NU  SG 


Interpretation: 

[FOR:  THE  A0069  / (FINDQ:  PERSON  (FIRSTNAME  BILL)) 

: T ; (FOR:  THE  A0070  / (FINDQ:  LOCATION  (AMBIG  WASHINGTON)) 

: T ; (FOR:  THE  A0071  / (FINDQ:  DB/TRIP 

(DESTINATION  A0070) 
(TRAVELER  AOO69)) 

: T ; (OUTPUT:  A 007 1 ] 


Optimized  interpretation: 


[FOR:  THE  A0069  / (FIND:  (INSTANCE/OF  (QUOTE  PERSON)) 

(FIRSTNAME  (QUOTE  BILL)  ) ) 

: T ; (FOR:  THE  A0070  / (FINDLOC  (INSTANCE/OF  (QUOTE  LOCATION)) 

(AMBIG  (QUOTE  WASHINGTON))) 

: T ; (FOR:  THE  A 0 0 7 1 / (FIND:  (INSTANCE/OF 

(QUOTE  DB/TRIP)) 
(DESTINATION  A0070) 
(TRAVELER  A0069)) 

: T : (OUTPUT:  A0071] 


("Washington"  is  ambiguous  since  it  i3  both  a state  and  a city. 
This  fact  is  noted  in  both  the  parse  tree  and  the  interpretation. 
All  verbs  that  reauest  a printout  ("print",  "list",  "give  (me)." 
"show")  affect  the  parse  tree  but  not  the  interpretation.} 


1 

! 
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GIVE  ME  A LIST  OF  THE  UNTAKEN  TRIP-S 


Found  complete  parser 
S IMP 

SUB J NP  PRO  YOU 

FEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V GIVE 

OBJ  NP  DET  THE 

ADJ  UNTAKEN 
N TRIP 
FEATS  NU  PL 


Interpretation: 

(FOR:  ALL  A 0 0 8 1 / (FINDQ:  DB/TRIP  (TIME  (AFTER  NOW))) 
: T ; (OUTPUT:  A008  1 ) ) 


Optimized  interpretation: 


(FOR;  ALL  A0081  / (FIND:  (INSTANCE/OF  (QUOTE  DB/TRIP)) 

( TIME  (QUOTE  FUTURE) ) ) 

: T ; (OUTPUT:  A008l)) 


(The  adjective  "untaken"  marks  the  trip  as  future.  Other  trip 
adjectives  that  would  affect  the  tense  are  "recent",  "taken" 
"upcoming1',  and  "past".  The  grammar  checks  agreement'  so  that 
Phrases  like  "recent  upcoming  trip"  and  "She  took  an  upcoming 
trip"  are  no*  accepted.) 
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CANCRL  LYN  -S  TRIP  TO  THE  ASA  MEETING 


Found  complete  parse: 


S IMP 

SUBJ  NP  PRO  YOU 

FEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V CANCEL 

OBJ  NP  DET  ART  THE 
N TRIP 
PP  PREP  FOR 

NP  NP  R NAME  FIRST  LYN 
PP  PREP  TO 

NP  DET  ART  THE 
ADJ  NP  NP  R ASA 
N MEETING 
FEATS  NO  SG 
FEATS  NU  SG 


Interpret. at  ion  : 

[FOR:  THE  A0088  / (FINDQ:  PERSON  (FIRSTNAME  LYN)) 

: T ; (FOR:  THE  A0089  / (FINDO:  DB/CONFERENCE  (SPONSOR  ASA)) 

: T ; (FOR:  THE  A0090  / (FINDQ:  DB/TRIP 

(TO/ATTEND  A0089) 
(TRAVELER  AOO88)) 
: T ; (CANCEL;  A0090] 


Optimized  interpretation: 


[FOR:  THE  AOO88  / (FIND:  (INSTANCE/OF  (QUOTE  PERSON)) 

(FIRSTNAME  (QUOTE  LYN))) 

: T ; (FOR:  THE  A0089  / (FIND:  (INSTANCE/OF  (QUOTE  DB/CCNFERENCE ) ) 

(SPONSOR  (QUOTE  ASA))) 

: T ; (FOR:  THE  A0090  / (FIND:  (INSTANCE/OF 

(QUOTE  DB/TRIP)  ) 
(TO/ATTEND  A0089) 
(TRAVELER  AOO88)) 

: T ; (CANCEL;  A 0 0 9 0 ] 


{Here  is  one  of  the  few  occurrences  of  an  action  other  than  TEST: 
or  OUTPUT:.  The  possessive  is  always  interpreted  as  a traveler 
in  this  system.) 


HOW  MANY  TRIP-S  HAS  RICH  TAKEN 


Found  complete  parse: 

S 0 

SUBJ  NPR  NAME  FIRST  RICH 
AUX  INS  PAST 

VOICE  ACTIVE 
VP  V TAKE 

CBJ  NP  DET  QUANT  HOW§MANY 
N TRIP 
FEATS  NU  PL 


Interpretation: 

[FOR:  THE  A0093  / (FINDQ:  PERSON  (FIRSTNAME  RICH)) 

: T ; (FOR:  THE  A0092  / [SETOF:  (FINDQ:  DB/TRIP  (TRAVELER  A009: 

(TIME  (BEFORE  NOW] 

: T ; (OUTPUT:  (COUNT:  A0092] 


Optimized  interpretation: 

[FOR:  THE  A0093  / (FIND:  (INSTANCE/OF  (QUOTE  PERSON)) 

(FIRSTNAME  (OUOTE  RICH))) 

: T • (FOR:  THE  A0092  / [SETOF:  (FINDQ:  DB/TRIP  (TRAVELER  A009; 

(TIME  (BEFORE  NOW] 

: T ; (OUTPUT:  (COUNT:  A0092] 


["How  many"  questions  must  result  in  interpretations  which  create 
a set  and  then  count  the  elements.  See  text  for  further 
description  of  how  the  gramnar  handles  this.} 
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DOES  THE  SPEECH  BUDGET  HAVE  A SURPLUS 


Found  complete  parse: 


S 0 

SUBJ  NP  DET  ART  THE 
SPEECH 
N BUDGET 
FEATS  NU  SG 
AUX  TNS  P RESENT 
VOICE  ACTIVE 
VP  V HAVE 

OBJ  NP  N SURPLUS 

FEATS  NU  SG 


Interpretation: 

(FOR:  THE  A0104  / (FINDQ:  DB/BUDGET  (PROJECT  (SPEECH))) 

: T ; (TEST:  (GREATERP  (GET:  A0104  MON E Y / REM A I N I NG  ) 

0))) 


« 


Optimized  interpretation: 

[FOR:  THE  A0104  / [FIND:  (INSTANCE/OF  'QUOTE  DB/BUDGET)) 

(PROJECT  (QUOTE  (SPEECH] 

: T ; (TEST:  (GREATERP  (GET:  A 0 1 0 i4  (QUOTE  MON  EY  / REM  A I N I NG  ) ) 

(QUOTE  0] 


{The  concept,  of  a surplus  is  not.  direct. ly  available  in  the  data 
base,  so  it  must  be  calculated  explicitly.} 
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MUMbH.:i|  pMiMM 


IS  JOHN  MAKHOIJL  SCHEDULE-ED  TO  GO  TO  PARIS 


Found  complete  parse: 


S 0 

SUBJ  NP  PRO  SOMEONE 
FEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  PASSIVE 
VP  V SCHEDULE 
OL  I S NIL 

SUBJ  NPR  NAME  FIRST  JOHN 

LAST  MAKHOUL 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V GO 

PP  PREP  TO 

NP  NPR  LOCATION  CITY  PARIS 


Interpr  tation: 

[TEST:  (FOR:  THE  AOIQO  / (FINDO:  PERSON  (LASTNAME  MAKHOUL) 

(FIRSTNAME  JOHN)) 

: T ; (FOR:  THE  A 0 11 0 / (FINDQ:  LOCATION  (CITY  PARIS)) 

: T ; (FOR:  ALL  A 0 1 1 1 / 

(FINDO:  DB/TRIP  (DESTINATION  A 0 1 10) 
(TRAVELER  A 0 1 0 9 ) ) 

: T ; T] 


Optimized  interpretation: 

f TEST : (FOR:  THE  A0109  / (FIND:  (INSTANCE/OF  (QUOTE  PERSON)) 

(LASTNAME  (QUOTE  MAKHOUL)) 

(FIRSTNAME  (QUOTE  JOHN)  ) ) 

: T ; (FOR:  THE  A 0 1 1 0 / (FINDLOC  (INSTANCE/OF  (QUOTE 

LOCATION) ) 

( CITY  (QUOTE  PARIS)  ) ) 

: T ; (FOR:  ALL  A 0 1 1 1 / 

(FIND:  (INSTANCE/OF  (QUOTE  DB/TRIP)) 

(DESTINATION  A 0 1 10) 

(TRAVELER  AO  1 09  ) ) 

: T ; T] 


{Yes/no  questions  may  be  answered  by  TEST:  either  as  the  primary 
form  or  as  an  action.  The  use  of  the  passive  voice  complicates 
the  parse  tree  but  not  the  interpretation.} 
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SCHEDULE  A TRIP  FOR  JACK  KLOVSTAD  TO  SDC 


i 


Found  complete  parse: 

S IMP 

SUBJ  NP  PRO  YGU 

FEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V SCHEDULE 

OBJ  NP  DF,T  ART  A 
N TRIP 
PP  PREP  FOR 

NP  R NAME  FIRST  JACK 

LAST  KLOVSTAD 

PP  PREP  TO 

NP  NP  R LOCATION  SDC 
FEATS  NU  SG 


Interpretation: 

(FOR:  THE  A 0 1 1 M / (FINDO:  PERSON  (LASTNAME  KLOVSTAD) 

(FIRSTNAME  JACK)) 

: T ; (FOR:  1 A 0 1 16  / (BUILDQ:  DB/TRIP  (DESTINATION  SDC) 

( TRAVELER  AOI  14  ) ) 

: T ; T ) ) 


Optimized  interpretation: 

(FOR:  THE  A 0 1 1 i)  / (FIND:  (INSTANCE/OF  (QUOTE  PERSON)) 

(LASTNAME  (QUOTE  KLOVSTAD)) 

(FIRSTNAME  (QUOTE  JACK))) 

: T ; (FOR:  1 A 0 1 1 6 / (BUILD:  DB/TRIP  (DESTINATION  (QUOTE  SDC)) 

( TRAVELER  AOI  14)) 

: T ; T) ) 

j 

1 

i 

1 

(This  sentence  is  ambiguous.  See  other  parsing  on  next  page.} 


S IMP 

SUBJ  NP  PRO  YOU 

FEATS  Nil  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V SCHEDULE 

OBJ  NP  DET  ART  A 
N TRIP 
PP  PREP  FOR 

NP  R NAME  FIRST  JACK 

LAST  KLOVSTAD 

PP  PREP  TO 
NP  N SDC 

FEATS  NU  SG 
FEATS  NU  SG 


Interpretation: 

(FOR:  THE  A 0 1 1 U / (FINDQ:  PERSON  (LASTNAME  KLOVSTAD) 

(FIRSTNAME  JACK)) 

: T ; (FOR:  THE  A 0 1 1 5 / (FINDQ:  DB/CONFERENCE  (LOCATION  SDC)) 
: T ; (FOR:  1 A 0 1 1 7 / ( BUILDQ  : DB/TRIP 

(TO/ATTEND  AOI 15) 

( TRAVELER  AOI 1 4 ) ) 

: T ; T) ) ) 


Optimized  interpretation: 

(FOR:  THE  A0114  / (FIND:  (INSTANCE/OF  (QUOTE  PERSON)) 

(LASTNAME  (QUOTE  KLOVSTAD)) 

( FIRSTNAME  (QUOTE  JACK  ) ) ) 

: T ; (FOR:  THE  A 0 1 1 5 / (FIND:  (INSTANCE/OF  (QUOTE  DB/CONFERENCE) 

( LOCATION  (QUOTE  SDC  ) ) ) 

: T ; (FOR:  1 A 0 1 1 7 / (BUILD:  DB/TRIP  (TO/ATTEND  A011 

(TRAVELER  A 0 1 1 4 ) ) 

: T ; T ) ) ) 


{This  is  ambiguous  because  SDC  can  be  either  a destination  or  the 
sponsor  of  a meeting.) 
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* LYN  -S  1 H 1 P TO  FRANCE  COST-FAST  SIX  HUNDRED  DOLLAR-S  • 


Found  complete  parse* 

S D'.i 

SUBJ  NP  DET  ART  THE 
N TRIP 
PP  PREP  FOR 

NP  NP  R NAME  FIRST  LYN 
PP  PREP  TO 

NP  NP  R LOCATION  COUNTRY  FRANCE 
FEATS  NU  SG 
AUX  TNS  PAST 

VOICE  ACTIVE 
VP  V COST 

OBJ  NP  $ 600 


Int  erpret  at ion 


[FOR:  THE  AO  Hi 8 / 
: T ; (FOR: 


(FINDQ:  PERSON  (FIRSTNAME  LYN)) 

THE  AO 1 49  / (FINDQ:  LOCATION  (COUNTRY  FRANCE)) 

: T ; (FOR:  THE  A0150  / (FINDQ:  DB/TRIP 

(DESTINATION  A0149) 
(TIME  (BEFORE  NOW)) 
(TRAVELER  AO  1 48  ) ) 

: T ; ( PROGN  (ADD:  A0150  VALUE  600) 

(ADD:  A 0 1 5 0 UNITS  DOLLARS] 


Optimized  interprets  t.  un: 

[FOR:  THE  A 0 1 4 8 / (FIND:  (INSTANCE/OF  (QUOTE  PERSON)) 

(FIRSTNAME  (QUOTE  LYN))) 

: T ; (FOR:  THE  A 0 1 4 9 ' (FINDLOC  (INSTANCE/OF  (QUOTE  LOCATION)) 

(COUNTRY  (QUOTE  FRANCE))) 

: T ; (FOR:  THE  A 0 1 5 0 / (FIND:  (INSTANCE/OF 

(QUOTE  DB/TRIP ) ) 
(DESTINATION  A 0 1 4 9 ) 
(TIME  (QUOTE  PAST)) 
(TRAVELER  A 0 1 4 8 ) ) 

: T ; (PROGN  (ADD:  A0150  (QUOTE  VALUE) 

(QUOTE  600)) 

(ADD:  A 0 1 5 0 (QUOTE  UNITS) 
(QUOTE  DOLLARS] 


{It  is  assumed  that  the  statement  is  giving  the  system  new 
information  which  must  be  added  to  the  data  base.  Since  only  one 
form  is  allowed  as  an  action,  the  LISP  form  PROGN  which  evaluates 
a sequence  of  forms  must  be  used.} 
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f 


HOW  MUCH  IS  IN  THE  SPEECH  UNDERSTANDING  BUDGET 


Found  complete  parse: 


S Q 

SUBJ  NP  DET  QUANT  HCWMUCH 
N ? 

FEATS  NU  MASS 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V BE 

FP  PREP  IN 

NP  DET  ART  THE 

SPEECH  UNPERSTANDING 
N BUDGET 
FEATS  NU  SG 


Interpretation: 

(FOR:  THE  A0146  / ( Flt'DQ  : DB/BUDGET  (PROJECT  (SPEECH  UNDERSTANDING))) 
: T ; (OUTPUT:  (GET:  A0146  MONEY/REMAINING))) 


Optimized  interpretation: 

[FOR:  THE  A01H6  / [FIND:  (INSTANCE/OF  (QUOTE  DB/BUDGET)) 

(PROJECT  (QUOTE  (SPEECH  UNDERSTANDING  ] 
: T ; (OUTPUT:  (GET:  A0 1 4 6 (QUOTE  MONEY/REMAINING] 


(Syntactically,  the  elliptical  head  of  "how  much"  could  be  any 
mass  ncun,  but  in  our  environment  the  only  semantic  head  is 
"money" . } 


PLEASE  GIVE  ME  THE  ACTUAL  COST  OF  TRIP  NUMBER  SIX  EIGHT  SEVEN  THRFE 


Found  complete  parse: 

S IMP 

SUBJ  NP  PRO  YOU 

FEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V GIVE 

OBJ  NP  DET  ART  THE 
ADJ  ACTUAL 
N COST 
PP  PREP  OF 
6873 

FEATS  NU  SG 


Interpretation: 

(FOR:  THE  A0 1 4 1 / (FINDQ:  DB/TRI?  (TRIP#  6873  )) 

: T ; IFOR:  THE  A 0 1 4 2 / (FINDQ:  DB/COST  (COST/OF  A01H1)) 
: T ; (OUTPUT:  A 0 1 4 2 ) ) ) 


Optimized  interpretation: 

(FOR:  THE  A0 1 4 1 / (FIND:  (INSTANCE/OF  (QUOTE  DB/TRIP)) 

(TRIP#  6873)) 

: T ; (FOR:  THE  A 0 1 4 2 / (FIND:  ( INST ANCF/OF  (QUOTE  DB/COST)) 

(COST/OF  A 0 1 4 1 ) ) 

: T ; (OUTPUT:  A 0 1 4 2 ) ) ) 


{Since  there  is  currently  no  difference  in  the  data  base  between 
"estimated  cost"  and  "actal  cost.",  the  adjective  has  no  effect  on 
the  interpretation  which  is  constructed.  The  number  given  as  a 
trip  number  must  have  the  correct  number  of  digits.} 
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ft 


PLEASE  SHOW  ME  JOHN  MAKHOUL  -S  THREE  TRIP-S  TO  PITTSBURGH 


Found  complete  parse: 

S IMP 

SUBJ  NP  PRO  YOU 

FEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V SHOW 

OBJ  NP  DET  THE 
ADJ  3 
N TRIP 
PP  PREP  FOR 

NP  NP  R NAME  FIRST  JOHN 

LAST  MAKHOUL 

PP  PREP  TO 

NP  NP  R LOCATION  CITY  PITTSBURGH 
FEATS  NU  PL 


Interpretation: 

[FOR:  THE  A 0 1 3 1 / (FINDQ:  PERSON  (LASTNAME  MAKHOUL) 

(FIRSTNAME  JOHN)) 

: T ; (FOR:  THE  A 0 1 32  / (FINDQ:  LOCATION  (CITY  PITTSBURGH)) 

: T ; (FOR:  (THE  3) 

AO  1 33  / (FINDQ:  DB/TRIP  (DESTINATION  A 0 1 3 2 ) 

(TRAVELER  A013U) 

: T ; (OUTPUT:  A 0 1 3 3 ] 


Optimized  interpretation: 

[FOR:  THE  A0 1 3 1 / (FIND:  (INSTANCE/OF  (QUOTE  PERSON)) 

(LASTNAME  (QUOTE  MAKHOUL)) 

(FIRSTNAME  (QUOTE  JOHN))) 

: T ; (FOR:  THE  A 0 1 3 2 / (FINDLOC  (INSTANCE/OF  (QUOTE  LOCATION)) 

(CITY  (QUOTE  PITTSBURGH))) 

: T ; (FOR:  (THE  3) 

A 0 1 3 3 / (FIND:  (INSTANCE/OF  (QUOTE  DB/TRIP ) ) 
(DESTINATION  A0132) 

( TRAVELER  A 0 1 3 1 ) ) 

: T ; (OUTPUT:  A0 133] 


(This  is  straightforward  except  for  the  auantifier  "three". 
Since  the  sentence  implies  that  there  are  exactly  3 such  trips, 
the  semantic  auantifier  (THE  3)  is  Renerated.} 
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WHO  IS  GO-ING  TO  IFIP 


Found  complete  parse: 

S 0 

SUBJ  NP  PRO  WHO 

FEATS  NU  SG/PL 
AUX  TNS  PRESENT  PROGRESSIVE 
VOICE  ACTIVE 
VP  V GO 

PP  PREP  TO 
NP  N IFIP 

FEATS  NU  SG 


Interpret  at  ion: 


[FOR:  THE  AOOU  / (FINDO:  DB/CONFERENCE  (SPONSOR  IFIP)) 

: T ; (FOR:  ALL  A0015  / (FINDO:  DB/TRIP  (TO/ATTEND  A 00 1 4 ) ) 
: T ; (OUTPUT:  (GET:  A0015  TRAVELER] 


Optimized  interpretation: 

[FOR:  THE  AOOlU  / (FIND:  (INSTANCE/OF  (QUOTE  DB/COt'FERENCE ) ) 

(SPONSOR  (QUOTE  IFIP))) 

: T ; (FOR:  ALL  A 00 1 5 / (FIND:  (INSTANCE/OF  (OUOTE  DB/TRIP)) 

(TO/ATTEND  A 00  1 4 ) ) 

: T ; (OUTPUT:  (GET:  A0015  (QUOTE  TRAVELER] 


(The  first  word  causes  the  action  to  be  set;  the  rest  of  the 
pause  builds  the  information  necessary  to  describe  a trip.) 
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Appendix  3 - Annotated  Syntax  Trace 


Appendix  3 contains  an  actuaJ  listing  of  a syntax  trace  file  with 
explanatory  notes  added.  The  sentence  was  entered  in  text  mode , one  event 
at  a time  to  simulate  bi-directional  middle-out  parsing.  While  the 
sentence  was  run  using  BIGGRAM,  and  BIGDICT,  all  the  relevant  arcs  used 
in  its  parsing  are  also  in  MIDGRAM.  Thus,  the  interested  reader  may  follow 
the  trace  in  more  detail  by  referring  to  Appendix  1.  Page  references 
within  this  appendix  are  to  the  page  num>er  at  the  top  of  the  trace  listing. 


-107- 


<C.BROWN>  A P TEND  I X -TRACE.  SYNTAX  ; 4 


Hon  20-Dec-76  6:  12PH  PAGE  1 


*********************************************.****** 
* * 


* BILL  -S  TRIP  TO  WASHINGTON  COSTS  NI  MTT  DCLLAR-S  * 


This  is  a syntax  trace  for 
the  above  sentence,  entered  In 
text  mode  but  simulating 
middle-out  bi-directional 
parsing. 


.Word  match  number 

Word  boundaries.  These  are 
arbitrary  for  text  input 

rWord  match  lexical  score 
(.always  0 for  text  input) 


TBIP 


r — Seed  event 

Weqinninq  new  island  with  word  natch 

/ 

Doinq  arc:  (WRD  (TRIP)  — (from  S/THE 

addinq  to  left  of  PREHODS  to  get  ((ADJ  trtsm 


0) 


I''  1 

■Brief  arc  description  (see  trace  listing 

page  1:1  for  a more  detailed 

311)  (to  S/THE-TRIP  3 1 3 ) ) explanation) 


Creatinq  SCONPIG  group: 

1:  S/THE  (2) -S/THE- TRIP  (3) 

Reqs  = (PREHODS  ((ADJ  TRIP))) 
States  = S/THE 
Scopedacts  = NIL 
Parent  SC  = NIL 


Segment  configuration. 

(see  trace  listing  page  1:2  for  detailed 

description) 


Doinq  arc:  (WPD  (TRIP)  — (fro*  T- DET- SG-D  EF/  363)  (to  TRIP/N0HBER1  422 

)) 

Creatinq  SCONPIG  qroup: 

2:  T-DET-SG-DEF/  (2  ) -TP I P/NU  fl  BER 1 ( 3) 

Reqs  = NIL 

States  = T-DET-SG-DEF/ 

Scopedacts  = NIL 
Parent  SC  = NIL 

Doinq  arc:  (WRD  (TRIP)  --  (fro*  T-SG/DET  377)  (to  TRIP/HEAD  420)) 

setting  SEHHEAD  to  DB/TRIP 
setting  HEAD  to  (N  TRIP) 
settinq  NU  to  SG 

Savinq  scoped  action  1 for  Sconfig  3 

with  scope  T-DET-SG-DEP/  T-DET-SG-INDEF/  T-DFTQ-SG/  T-ENTER/ 
T-SG-NON-DEH/  T-SG-POSS?/ 

Savinq  scoped  action  2 for  Sconfig  3 

with  scope  T-DET-SG-DEP/  T-DET-SG-INDEF/  T-DETQ-SG/  T-ENTER/ 
T-SG-NON-DEH/ 
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<G  DF  OW  N>  APPENDIX -TRACE.  SYNTAX  ;4  Mon  20-Dec-76  6 : 1 2P  fl  PAGE  1:1 


Creating  SCONPIG  qroup: 

3:  T-SG/DET(  2) -TRIP/HEAD  (3) 

Reqs  = (NU  SG  HEAD  (N  TRIP)  SEMHEAD  DB/TRIP) 

States  = T-SG/DET 

Scopedacts  = (12)  .Left  state  name  and  number 

Parent  sc  = Hll  /kxcL  tyfe.  , / . , „ , . 

yS  .Arc  label  / Right  state  name  and  number 

Domq  arc:  (HRD  (TRIP)  — (from  TO/TAKB-A  405)  (to  TO/GO  393)) 

£S9ttinq  OBJ  to  (NP  (DET  (ART  A))  (N  TRIP)  (PEATS  (Nil  SG))) 

„ ..  ' Arc  action  setting  a register 

Creatinq  SCONPIG  qroup:  6 6 

4:TO/TAK  E-A  (2) -TO /GO  (3) 

Reqs  = (OBJ  (NP  (DET  (ART  A))  (N  TRIP)  (PEATS  (FO  SG)  ) ) ) 

States  = TO/TAKE- A 
Scopedacts  = NIL 
Parent  SC  = NIL 


Doinq  left  proposals  for  Iconfiq  1 

found  39  arcs  reachinq  Iconfiq. 
Doinq  riqht  proposals  for  Iconfiq  1 
found  51  arcs  reachinq  Iconfiq. 


These  refer  to  the  number  of 
different  terminal-consuming 
arcs  from  which  the  predictions 
for  the  island  are  drawn. 


Created  new  island: 

Island  confiq  1=2  TRIP  3 
Left  spansflaq  = NIL 
Riqht  spansflaq  - NIL 
Parent  IC  = NIL 


Isi  ,nd  configuration. 

Spanning  flags  indicate  whether  the  end 
of  the  utterance  has  been  reached. 
Referred  to  as  "["  or  "]"  in  the  boxes. 


Adding  possessive  morpheme 
-S  to  left  of  island 
containing  the  word  TRIP. 


************** 


* -S  <-  TRIP  * 


Process  inq  event  for  Iconfiq  1 with  word  natch  (2  -S  1 2 0) 

Spanninq  flaqs  Left:  NIL  Riqht:  NIL 

- These  are  the  SCONFIGs 

Extendinq  To  Do  Queue  froi  Sc’s:  (1  2 3 4)  from  the  old  island. 

« . 

Doinq  consuming  arcs  for  the’event. 

The  fact  that  no  arcs  are  processed  between  extending  the  TODOQ  and  doing  the 
consuming  arcs,  indicates  that  there  are  no  intervening  levels  or  arcs  between 
the  old  island  and  the  new  arc. 
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<GBROWN>  APPENDIX -TRACE.  SYNTAX;  4 


Non  20-Dec-76  6: 12PN  PAGE  1:2 


Doing  arc:  (WRD  (-S)  — (froii  T-SG-POSS?/  376)  (to  T-SG/DET  377)) 


’} 


Savinq  scoped  action  3 for  Sconfig  6 
with  scope  T-SG-NON-DEN/ 

Poinq  scoped  action  1 for  SC  5 

in  arc  (WRD  (TRIP)  — (fro*  T-SG/DET  377) 


Saving  action  to  wait  for  proper 

left  context,  see  page  1:5  for  execution. 

(to  TRIP/HEAD  420)) 


Creatinq  SCONFIG  qroup: 

5:  T-SG-  POSS  ?/  (1 ) -T  RIP/HEAD(  3) 

Feqs  = (NH  SG  HEAD  (N  TRIP)  SENHEAD  DB/TBIP)^ 
States  = T-SG-POSS?/  T-SG/DET 
Scopedacts  = (3  2) 

Parent  SC  = 3 


)oi  nq 
)oi  nq 


left  proposals  for  Iconfiq  2 
found  3 arcs  reaching  Iconfig. 

riqht  proposals  for  Iconfig  2 
found  42  arcs  reaching  Iconfig. 


<SCONFIG  number> : 

<Left  state>(<Left  boundary>)- 
<Right  state> (<Right  boundary>) 

'''Current  register  settings 

eft  states  of  arcs  already 
executed 

Outstanding  scoped  actions 


Created  nee  island: 


Island  confiq  2 = 1 -S  2 TRIP  3 
Left  spansflaq  = NIL 
Riqht  spansflaq  - NIL 
Parent  IC  = 1 


******************* 
* * 

* BILL  <-  -S  TRIP  * 

* <*.' 

******************* 


Processing  event  for  Iconfiq  2 with  word  natch  |1  BILL  0 10) 
Spanning  flags  Left:  NIL  Right:  NIL 


Extending  To  Do  Queue  fron  Sc's:  (5) 

Extending  UCtQ:  (5) 

Creatinq  new  SCONFIG  group  at  different  level: 
6:  N AH  F/POP  ( 1)  -NAHE/POP  ( 1 ) 

Reqs  = NIL 
States  = NIL 
Scopedacts  * NIL 
Parent  SC  = NTL 
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<GBROVN>AP  ~'i  DIX-TR ACE. SYNTAX  ;4 


on  20-Dec-76  6:12PH  PAGE  1:1 


Doinq  arc:  (JUfSP  NAME/POP  --  (from  NAHE/PIRST  124)  <to  NAHE/POP  125)) 

Savinq  scoped  action  4 for  Sconfiq  7 
with  scope  NAME/ 

Creatinq  SCONPIG  group: 

7:  N A M E/ PT  RST  (1)  - N A fl  E/POP  ( 1 ) 

Reqs  - NIL 
States  = NAM  E/PIRST 
Scopedacts  = (4) 

Parent  SC  = 6 

Doinq  consuming  arcs  for  the  event. 


Doinq  arc:  (CAT  FT RST  NAN E — (from  NAME/  123)  (to  NAHE/PIRST  124)) 

setting  NAME  to  T 
settinq  PIRST  to  BILL 
Doinq  scoped  action  4 for  SC  fl 

in  arc  (JUMP  NAME/POP  — (from  NAHE/PIRST  124)  (to  NAHE/POP  125)) 

Creatinq  SCONPIG  group: 

8:  N AM  E/  (0)  ~ NAHE/POP  (1) 

Reqs  =1  (PIRST  RILL  NAME  T) 

States  = NAME/  NAHE/PIRST 
Scopedacts  = NIL 
Parent  SC  = 7 

Doinq  left  proposals  for  Iconfig  1 

found  27  arcs  reaching  Iconfig. 

Doinq  riqht  proposals  for  Iconfig  1 
found  42  arcs  reachinq  Iconfiq. 

Created  new  island: 

Island  confiq  3 = 0 BILL  1 -S  2 TRIP  3 
Left  spansflaq  = NIL 
Riqht  spansflaq  - NIL 
Parent  IC  = 2 


********* ********* 

* * 

* T BILL  -S  TRIP  * 

* * 
****(************* 

Doing  end  event  for  the  left  end  of  the  island,  i.e.,  connecting 
the  island  to  the  start  state,  S/. 
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Tryinq  to  connect  to  left  end  of  sentence. 

Fxtendinq  To  Do  Queue  from  Sc’s:  (8) 

Splittinq  the  UB!  Queue:  (8) 

Comnletinq  constituents  for  the  UB1C0MPLETEQ:  ( f) 

Doinq  arc:  (Pop  --  (from  NAHE/POP  125)) 

settinq  TNTERP  to  (!  THE  A0013  PERSON  ((FIRSTNAME  BILL))) 
sett  inq  ILIPTLIST#  to  (INTEPP) 
liftinq  reqister  INTERP. 

Savinq  scoped  action  5 for  Sconfiq  9 
with  scope  POP 

Creatinq  SCONFIG  qroup: 

9: NAME/  (0 ) - NAME/POP  (1) 

Peqs  = { # LIFTL IST#  (INTERP)  INTERP  (!  THE  A f?  0 1 3 PFLoON  ((FIRSTNAME 
BILL)))  FIRST  BILL  NAHET) 

States  = NAME/POP  NAME/  NAME/FIRST 
Scopedacts  = (5) 

Parent  sc  = 8 

Creatinq  direct  connection  to  new  constituent  f tea: 

5:T-SG-P0SS?/i  1}  -TR TP/HEA D ( 1) 

Reqs  = (Nil  SG  HEAD  (N  TRIP)  SEMHEAD  DB/TR1P) 

States  = T-SG-POSS?/  T-SG/DET 
Scopelacts  = (3  2) 

Parent  SC  = 3 


Creatinq  constituent  from  Sconfiq  9: 


(!  THE  A00H  PERSON 


((FIRSTNAME  BILL))) 


Semantic  representation 
new  constituent 


of 


NPR  NAM  E FIRST  BILL* 


•Syntactic  tree  representation  of 
new  constituent 
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noinq  arc:  (PUSH  NAHE/  — (from  T-SG-NON-DEH/  375)  (to  T-SG-POSS?/  376 
)) 


settinq  SEHQUA  NT  to  THE 

addinq  to  left  of  S ERL  INKS  to  qet  ((TRAVELER  (!  THE  A0013  PERSON 
((  FIPSTNAHE  BILL)  ) ) ) ) 

settinq  DET  to  (DPT  THE) 
settinq  PERSONflOD  to  T 

settinq  PERSON  to  (NPR  (NAME  (FIRST  BILL))) 

Doinq  scoped  action  3 for  SC  10 

in  arc  (WRD  (-S)  --  (froa  T-SG-POSS?/  376) 

settinq  DET  to  (DET  (ART  THE)) 


Execution  of  scoped  action 
saved  on  page  1:2  above.  Note 
that  scope  is  same  as  left 
state  of  the  arc  being  executed 

(to  T-SG/DET  377) 


addinq  to  left  of  NHODS  to  get  ((PP  (PREP  FOR)  (NP  (NPR  ( N AH  E ) 
(FIRST  BILL)))))) 

Doinq  scoped  action  2 for  SC  10 

in  arc  (HRD  (TRIP)  --  (froa  T-SG/DET  377)  (to  TRIP/HEAD  420)) 


Creatinq  SCONPIG  qrouD: 

10:T-SG-NON-DEH/(-)-TRIP/HEAD(3) 

Reqs  = (NHODS  ( (PP  (PREP  POR)  (NP  (NPR  (DANE  (FUST  BILL))))))  DET 
(DET  (ART  THE))  PERSON  (NPR  (NAHE  (PIRST  BILL)))  PERSONHOD  T DET  (DET 
THE)  SEHLINKS  ((TRAVELER  (!  THE  A0013  PERSON  ( ( FIRSTN  AHE  BILL))))) 
SEflQUANT  the  NU  SG  HEAD  (N  TRIP)  SEHHEAD  DB/TRIF) 

States  = T-SG-NON-DEH/  T-SG-POSS?/  T-SG/DET 
Scopedacts  = NIL 
Parent  SC  = 5 


Eatendinq  To  Do  Queue  from  Sc's:  (10) 

Doinq  arc:  (JUHP  T- SG-NON-DEM/  — (froa  T-DET- SG-DEF/  363)  (to 
T-SG-NON-DEH/  375)  ) 

Creatinq  SCONPIG  qroup: 

11:  T-DET-  SG-DEP/(0)  - TRIP/HEAD  (3) 

Reqs  = (NHODS  ( (PP  (PREP  FOP)  (NP  (NPR  (NAHE  (FIRST  BILL))))))  DET 
(DET  (ART  THE))  PERSON  (NPR  (NAHE  (PIRST  BILL)))  PERSONHOD  T DET  (DET 
THE)  SEHLINKS  ((TRAVELER  (!  THE  A0013  PERSON  ( ( FIRSTNAHB  BILL))))) 
SEHQUA NT  THE  NU  SG  HEAD  (N  TRIP)  SEHHEAD  EB/TRII) 

States  * T-  DET- SG-DEF/  T-SG-NON-DEH/  T-SG-POSS?/  T-SG/DET 
Scopedacts  = NIL 
Parent  SC  = 10 


Doinq  arc:  (JUHP  T-DET- SG-DEF/  --  (froa  T-DET-SG/  365)  (to 
T-DET-SG-DE F/  363)) 

Creatinq  SCONPIG  group: 

12:T-DET-SG/(0)  -TRIP/HEAD(3) 

Reqs  = (NHODS  ( (PP  (PREP  POR)  (NP  (NPR  (NAHE  (PIRST  BILL))))))  DET 
(DET  (ART  THE))  PERSON  (NPR  (NAHE  (PIRST  BILL)))  PERSONHOD  T DET  (DET 
THE)  SEHLINKS  ((TRAVELER  (!  THE  A0013  PERSON  ((IIRSTNAHE  BILL))))) 
SPHQUAN  T THE  NU  SG  HEAD  (N  TRTP)  SEHHEAD  DB/TRIF) 

states  = T-DET-SG/  T- DET- SG-DEP/  T-SG-NON-DEH/  1-SG-POSS?/  T-SG/DET 
Scopedacts  * NIL 
Parent  SC  = 11 
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Splittinq  the  UB!  ’eue:  (12) 

Extending  (J  B JOPENR IGHTQ:  (12) 

Creatinj  new  SCONPIG  qroup  at  different  level: 

11:  START/  (0)  -START/  (-  ) 

Rsqs  = NTL 
States  = NIT, 

Soopedacts  = NIL 
Parent  SC  = NIL 

Doinq  arc:  (JUNP  START/  — (froii  S/  155)  (to  S1AFT/  357)) 

Great,  inq  SCONPIG  qroup: 

14:  S/(0)  -START/(-) 

Reqs  = NIL 
States  - S/ 

Scopedacts  = NIL 
Parent  SC  = 1 1 

Doinq  left  proposals  for  Iconfiq  4 
found  0 arcs  reaching  Iconfig. 

Doinq  riqht  proposals  for  Iconfig  4 
found  Ifl  arcs  reachinq  Iconfig. 

Created  new  island: 

Island  confiq  4=0  BILL  1 -S  2 TRIP  3 
Left  spansflaq  = T 
Riqht  spansflaq  - NIL 
Parent  IC  = 3 


************** 
* * 

* WASHINGTON  * 

* * 
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Beginning  new  seed. 

Beqinninq  new  island  with  word  natch  (5  WASHI NGTCN  4 5 0) 


Doinq  arc:  (CAT  CITY  — (froa  CITY/  39)  (to  CI1Y/CITY  40)) 
addinq  to  left  of  SEBLINKS  to  qet  ((CITY  WASHINGTON) ) 
settinq  CITY  to  WASHINGTON 
settinq  LOCATION  to  T 

Creatinq  SCONPIG  qroup: 

15:  CITY /( 4)  -CITY/CITY  (5) 

Reqs  « (LOCATION  T CITY  WASHINGTON  SEBLINKS  ((CITY  WASHINGTON)  ) ) 
States  = CITY/ 

Scopedacts  = NIL 
Parent  SC  = NIL 

Doinq  arc:  (CAT  STATE  — (fro»  CITY/  39)  (to  CITY/POP  41)) 
settinq  LOCATION  to  T 
settinq  ANBIG  to  WASHINGTON 

addinq  to  left  of  SEBLINKS  to  qet  ((AHBIG  WASHINGTON)) 

Creatinq  SCONPIG  qroup: 

16  : CTTY / ( 4)  -C  ITT/POP  (5) 

Reqs  = (SEBLINKS  ((AHBIG  WASHINGTON))  ANBIG  WASEINGTON  LOCATION  ?) 
States  = CITY/ 

Scopedacts  = NIL 
Parent  SC  = NIL 

Doinq  arc:  (CAT  STATE  — (froa  CITY/CITY  40)  (tC  CITY/POP  41)) 
addinq  to  left  of  SEBLINKS  to  qet  ((STATE  WASHINGTON)) 
settinq  state  to  Washington 
Sawinq  scoped  action  6 for  Sconfiq  17 
with  scope  CITY/ 

Creatinq  SCONPIG  qroup: 

17:  CITY /CITY  (4)  -CITY/POP  (5) 

Reas  = (STATE  WASHINGTON  SEBLINKS  ((STATE  W ASHI IGTON) ) ) 

States  = CITY/CITY 
Scopedacts  = (6) 

Parent  SC  = NIL 

Doinq  left  proposals  for  Iconfiq  5 

found  19  arcs  reachiinq  Iconfiq. 

Doinq  riqht  proposals  for  Iconfiq  5 
found  50  arcs  reachinq  Iconfiq. 

Created  new  island: 

Island  confiq  5=4  WASHINGTON  5 
Left  spansflaq  = NIL 
Riqht  spansflaq  - NIL 
l Parent  IC  = NIL 

1 


• v.  ■ 


; <GBROWN>  APPENDIX  -TRACE,  SY  NT  AX  ; 4 


Non  20- Dec-76  6-.12P1  PA(iF  1:9 


************************************** 

* * 

* r bill  -s  TRIP  > TO  <-  WASHINGTON  * 

* * 
************************************** 

Island  collision  event,  adding  new  word  TO.  Processing  extends  longer  of  the  two 
islands  by  addira  the  new  word,  then  adding  the  words  from  the  shorter  island  one  at 

a time. 

Processinq  event  for  Iconfiq  4 with  word  match  |4  TO  14  0) 

Spanninq  flaqs  Left:  T Riqht:  NIL 


Extendinq  To  Do  Queue  from  Sc's:  (12) 

Extendinq  B ! Q:  (12) 

Creatinq  new  SCONFIG  qroup  at  different  level: 
18 : TR  IP -TO/  ( 3)  -TR  IP-T0/(3) 

Reqs  = NIL 
States  = NIL 
Scopedacts  = NIL 
Parent  SC  = NIL 

Doinq  consuainq  arcs  for  the  event. 


Doinq  arc:  ( WRD  (TO)  --  (from  TRIP-TO/  410)  (tc  TRIP/TO  427)) 

Creatinq  SCONPIG  qroup: 

19:TRTP-T0/  ( 1)  - TRIP/TO  (4) 

Reqs  = NIL 
States  = TRIP-TO/ 

Scopedacts  = NIL 
Parent  sc  = 18 

Doinq  left  proposals  for  Iconfiq  6 
found  3 arcs  reachinq  Iconfiq. 

Doinq  riqht  proposals  for  Iconfiq  6 
found  9 arcs  reachinq  Iconfiq. 

Created  new  island:  Island  configuration  created  as 

intermediate  step  of  island 

Island  confiq  6 = 0 BILL  1 -S  2 TRIP  3 TO  4 collision  processing. 

Left  spansflaq  = T 
Riqht  spansflaq  - NIL 
Parent  IC  = 4 

This  is  part  of  island  collision  processing. 

Processinq  event  for  Iconfiq  6 with  word  natch  (5  WASHINGTON  4 S 0) 
Spanninq  flaqs  Left:  T Riqht:  NT L 
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Extendinq  To  Do  Queue  from  Sc’s:  (19) 

Extendinq  B ! Q:  (19) 

Creatinq  new  SCONFIG  qrouo  at  different  level: 
20 : CT  TY/  (4)  -CITY/  (4) 

Reqs  = NIL 
States  = NIL 
Scopedacts  = nil 
Parent  SC  = NIL 

Doinq  consuming  arcs  for  the  event. 


Doinq  arc:  (CAT  CITY  --  (froe  CITY/  39)  (to  CI1Y/CITY  40)) 
addinq  to  left  of  SEMLINKS  to  get  ((CITY  WASHINGTON)) 
settinq  CITY  to  WASHINGTON 
setting  LOCATION  to  T 

Creatinq  SCONPIG  qroup: 

21  :CITY/(4)  -CITY/CITY  (5) 

Peqs  = (LOCATION  T CITY  WASHINGTON  SEMLINKS  ((CITY  WASHINGTON))) 
States  = CITY/ 

Scopedacts  - NIL 
Parent  SC  = 20 

Doinq  arc:  (CAT  STATE  --  (fro#  CITY/  39)  (to  CITY/POP  41)) 

settinq  LOCATION  to  T 
setting  AM 31 G to  WASHINGTON 

addinq  to  left  of  SEMLINKS  to  qet  ( ( AHBIG  WASHINGTON)) 

Creatinq  SCONPIG  group: 

22:  CITY/ (4)  -CITY/ POP  (5) 

Reqs  = (SEMLINKS  ((AHBIG  WASHINGTON))  AHBIG  WASHINGTON  LOCATION  T) 
States  = CITY/ 

Scopedacts  = NIL 
Parent  SC  = 20 

Doinq  left,  proposals  for  Tconfig  7 
found  0 arcs  reachinq  Iconfig. 

Doinq  riqht  proposals  for  Iconfig  7 
found  17  arcs  reachinq  Iconfig. 

Created  new  island: 

Island  confiq  1 = 0 BILL  1 -S  2 TRIP  3 TO  4 WASHINGTON  5 
Left  spansflaq  = T 
Riqht  spansflaq  - NIL 
Parent  IC  = 6 
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***************************************** 

♦ * 

* r BILL  -S  TRIP  TO  WASHINGTON  ->  COSTS  * 

* * 
***************************************** 

Note  that  this  event  requires  the  completion  of  3 lower  level  constituents. 


Processing  event  for  Iconfiq  7 with  word  latch  (6  COSTS  5 6 0) 
Spanning  flags  Left:  T Right:  NIL 


Extending  To  Do  Queue  froi  Sc's:  (22  21  ) 

Doing  arc:  (JUMP  CITY/POP  — (froi  CITY/CITY  40)  (to  CITY/POP  41)) 
Verify  failed  for  action.  (NOT  ( C ATCHECK  (WORD  (GETR  CITY))  (QUOTE 
STATE) ) ) 

in  arc:  (JUMP  CITY/POP  — (froi  CITY/CITY  40)  (to  CITY/POP  41)) 

Arc  failed. 

Splitting  the  CIQueue:  (22)  - 

Coipleting  constituents  for  the  CICOHPLETFO:  (2J) 

Doinq  arc:  (POP  --  (from  CITY/POP  41)) 

setting  INTERP  to  (!  THE  A0014  LOCATION  ((IHBIG  WASHINGTON))) 
setting  #LI  FT  LIS  T # to  (INTERP) 
lifting  register  INTERP. 

Saving  scoped  action  7 for  Sconfiq  23 
with  scope  POP 

Creating  SCONPIG  group: 

23:  CITY/  (4)  -CITY/POP  (5) 

Regs  * (VLIPTLIST#  (INTERP)  INTERP  (!  THE  A0014  LOCATION  ((AHBIG 
WASHINGTON)))  S EHLINKS  ( ( AH  BIG  W ASH  INGTON  ) ) AHBIG  WASHINGTON  LOCATION 
T) 

States  = CITY/POP  CITY/ 

Scopedacts  = (7) 

Parent  SC  = 22 

Creating  direct  connection  to  new  constituent  ftci: 

19:  TRIP-TO/  (3)  -TRIP/TO  (4) 

Regs  = NIL 
States  = TRIP-TO/ 

Scopedacts  = NIL 
Parent  SC  = 18 


Creatinq  constituent  from  Sconfig  23: 

(!  THE  A0014  LOCATION  ((AHBIG  WASHINGTON))) 
NPF  LOCATION  AHBIG  WASHINGTON 
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Doinq  arc:  (PUSH  CITY/  --  (froa  TRTP/TO  427)  (to  TFIP-TO/POP  411)) 

settinq  INTERP  to  (DESTINATION  (!  THE  A0014  LOCATION  ( (ANBIG 
WASHINGTON) ) ) ) 

settinq  PP  to  (PP  (PREP  TO)  (NP  (N  PR  ( LOCATION  ( AHBIG  WAS  HINGTON) ) ) ) ) 

Creatinq  SCONFIG  qroup: 

24:  TRIP-TO/  (I) -TR  IP-TO/POP  (6) 

Reqs  = (PP  (PP  (FREP  '"O)  (NP  (NPR  (LOCATION  (AH  EIG  WASHINGTON))))) 

INTERP  (DESTINATION  (!  THE  A0014  LOCATION  ((AHBIG  WASHINGTON))))) 

States  = TRIP/TO  TRIP-TO/ 

Scopedacts  = NIL 
Parent  SC  = 19 

Extendinq  To  Do  Oueue  froa  Sc's:  (24) 

Splittinq  the  CJQueue:  (24) 

Conpletinq  constituents  for  the  C 1C0HPL  BTEQ : (24) 

Doinq  arc:  (POP  — (from  TRIP-TO/POP  411)) 

settinq  ILIFTLISTI  to  (INTERP) 
lifting  reqist.er  INTERP. 
settinq  ALIFTLIST#  to  (SEHLINKS  INTERP) 
liftinq  reqist.er  SEHLINKS. 

Savinq  scoped  action  8 for  Sconfig  25 
with  scope  POP 

Creatinq  SCONFIG  group: 

25:  TR  IP-TO/  ( 3)  -TR  IP-TO/POP  (5) 

Reqs  = (ILIFTLIST#  (SEHLINKS  INTERP)  # L IFTLIST#  (INTERP)  PP  (PP  (PREP 
TO)  (NP  (NPR  (LOCATION  ( AH  BI G W A SHI  NGTON)  ) ) ) ) I STEEP  (DESTINATION 
( ! THE  A0014  LOCATION  ( (AHBIG  WASHINGTON))))) 

States  = TRIP-TO/POP  fRIP/TO  TRIP-TO/ 

Scopedacts  = (8) 

Parent  SC  = 24 

Creatinq  direct  connection  to  new  constituent  free: 

12:T-DET-SG/(0)  -TRIP/HEAD  (3) 

Reqs  = ( N HODS  ( (PP  (PREP  FOR)  (NP  (NPR  (NANI  (FIRST  BILL))))))  DET 
(DET  (ART  THE))  PERSON  (NPR  (NAHE  (FIRST  BILL)))  PERSONHOD  T DET  (DET 
THE)  SEHLINKS  ((TRAVELER  (!  THE  A0013  PERSON  ( ( f IBSTNAHE  BILL))))) 

SEHQUANT  THE  NU  SG  HEAD  (N  TRIP)  SEHHEAD  CB/TRIP) 

States  = T-DET-SG/  T-DET- SG-DEE/  T- SG-NON-DEH/  T-SG-POSS?/  T-SG/DET 
Scopedacts  = NIL 
Parent  SC  = 11 


Creatinq  constituent  from  Sconfig  25: 

(DESTINATION  (!  THE  A0014  LOCATION  ((AHBIG  WASHINGTON)))) 
PP  PREP  TO 

NP  NPR  LOCATION  AHBIG  WASHINGTON 
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Doing  arc:  (PUSH  TRIP-TO/  — (from  TRIP/HEAD  4 20)  (to  TRIP/TO-HOD  429)) 
adding  to  left  of  SEHLINKS  to  get  ((DESTINATION  (!  THE  A0014 
LOCATION  ( ( AfIBIG  WASHINGTON))))  (TRAVELER  {!  TH  ! A0F13  PERSON  (( 
FIRSTNAME  B ILL)  ) ) ) ) 

addinq  to  Left  of  NMODS  to  get  { (PP  (PREP  TO)  (NP  (NPR  (LOCATION 
(AH  BIG  WASHINGTON)))))  (PP  (PREP  POR)  (NP  (NPR  (NAHF  (PIRST  BILL)))))) 

Creatinq  SCONPIG  qroup: 

26  : T-DRT-SG/  (0)  -TRI  P/TO-MOD  (6) 

Reqs  = (NMODS  ( (PP  (PREP  TO)  (NP  (NPR  (LOCATION  (AHBIG  WASHINGTON))))) 
(PP  (PREP  POR)  (NP  (NPR  (NAME  (FIRST  BILL))))))  SEHI.INKS  ((DESTINATION 
(!  THE  A0014  LOCATION  ( ( AHBI G W AS  HINGTON)  ) ) ) (TRAVELER  (t  THE  A0013 
PERSON  ( ( FIR  STN  AH  E BILL)))))  NHODS  ((PP  (PREPPCF)  (NP  (NPR  (NAHE 
(FIRST  RILL))))))  DET  (DET  (ART  THE))  PERSON  (N IF  (NAHE  (PIRST  EILL))) 
PEPSONHOD  T DET  (DET  THE)  S EHLINKS  ((TRAVELER  (!  THE  A0013  PERSON 
((FIRSTNAME  BILL)))))  SEHQU ANT  THE  NU  SG  HEAD  (A  TRIP)  SEHHEAD  DB/TRIP) 
States  = TRIP/HEkD  T-DET-SG/  T-DET-SG- DEF/  T-SG-NON-DEH/  T-SG-POSS?/ 
T-SG/DET 
Scopedacts  = NIL 
Parent  SC  = 12 

Extending  To  Do  Queue  froa  Sc*s:  (26) 

Doing  arc:  (JUHP  TR I P/TO- FOR- HOD  --  (froa  TRIP/TO-HOD  429)  (to 
TPI P/TO-FOR-HOD  4 2 B)  ) 

Creating  SCONFIG  group: 

27 : T-  DET-  SG/  (0)  -TRIP/TO-TOR  HOD  (5) 

Regs  = (NHODS  ( (PP  (FREP  TO)  (NP  (NPR  (LOCATION  (AH  B IG  W A SHI  NG  TON ) ) ) ) ) 
(PP  (PREP  POR)  (NP  (NPR  (NAME  (FIRST  RILI))))))  SEHLINKS  ((DESTINATION 
( ! THE  A0014  LOCATION  ( ( AHBTG  WASHINGTON))))  (TFAVELFR  (!  THE  A0013 
PEPSON  ((PIRSTNANE  BILL)))))  NHODS  ( (PP  (PREP  FCR)  (NP  (NPR  (NAHE 
(FIRST  BILL,)))))  DET  (DET  (ART  THE))  PERSON  (NEF  (NAHE  ( FT3ST  BILL))) 
PERSONHOD  T DET  (DET  THE)  SEHLINKS  ((TRAVELER  (!  THE  A0013  PERSON 
((FIRSTNANE  BILL)))))  SENQUANT  THE  NU  SG  HEAD  (A  TRIP)  SEHHEAD  DB/TRIP) 
states  = TRIP/TO-HOD  TRTP/HEAD  T-DET-SG/  T- DET- SG-DEF/  T-SG-NON-DEH/ 
T-SG-POSS?/  T-SG/DET 
Scopedacts  = NIL 
Parent  SC  = 26 

Doing  arc:  (JUHP  TRIP/DATE?  — (from  TRI P/TO-FCR-HOD  428)  (to 

TRIP/DATE?  416)  ) 

Creatinq  SCONFIG  group: 

28 : T- DET- SG/(0) -TRIP/DATE?  (5) 

Reqs  = (NMODS  ( (PP  (PREP  TO)  (NP  (NPR  (LOCATION  (AHBIG  WASHINGTON))))) 
(PP  (PREP  FOR)  (NP  (NPR  (NAME  (FIRST  BILL))))))  SEHLINKS  ( (DESTINATION 
(!  THE  A 00 1 4 LOCATION  ((AHBIG  WASHINGTON))))  (TFAVELFR  (!  THE  A0013 
PERSON  ((FIRSTNAME  BILL)))))  NHODS  ( (PP  (PREP  FCR)  (NP  (NPR  (NAHE 
(FIRST  BILL))))))  DET  (DET  (ART  THE))  PERSON  (NFR  (NAHE  (FIRST  BILL))) 
PERSONMOD  T DET  (DET  THE)  SEHI.INKS  ((TRAVELER  (!  THE  A0013  PERSON 
((FIRSTNAME  BILL)))))  S EMDU  ANT  THE  NU  SG  HEAD  ( K TRIP)  SEHHEAD  DB/TRIP) 
States  = TR I P/TO-FOR-HOD  TRIP/TO-HOD  TRIP/HEAD  1-DET-SG/  T-DET-SG-DEP/ 
T-SG-NON-DEH/  T-SG-POSS?/  T-SG/DET 
Scoppdacts  = NIL 
Parent  SC  = 27 
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Doinq  arc:  (JUHP  TRIP/POP  — (fro*  TBIP/DATE?  416)  (to  TRIP/POP  425)) 

Creatinq  SCONFIG  qroup: 

29 : T- DET- SG/  (0) -TR I P/POP  (5) 

Reqs  = ( NHODS  ( (PP  (PREP  TO)  (NP  (NPR  (LOCATION  {AHBIG  WASHINGTON))))) 
(PP  (PREP  POR)  (NP  (NPR  (NAHE  (FIRST  BILL))))))  SEHLINKS  ((DESTINATION 
(1  THE  A0014  LOCATION  ((AHBIG  WASHINGTON))))  (TFAVELER  (!  THE  A0013 
PERSON  ((PIRSTNAHE  BILL)))))  NHODS  ( (PP  (PREP  PCR)  (NP  (NPR  (NAHE 
(PIRST  BILL))))))  DET  (DET  (ART  THE))  PERSON  (NFR  (NAHE  (FIRST  BILL))) 
PERSONHOD  T DET  (DET  THE)  SEHLINKS  ( (TRAVELER  (S  THE  A0013  PERSON 
((PIRSTNAHE  BILL)))))  S EM  Oil  ANT  THE  NU  SG  HEAD  ( N TRIP)  SERHEAD  DB/TRIP) 
States  = TRIP/DATE?  TSIP/TO-POR-HOD  TRIP/TO-HOD  TRIP/HEAD  T-DET-SG/ 
T-DET-SG-DEP/  T-SG-NO N-DEH/  T-SG-POSS?/  T-SG/DET 
Scopedacts  = NIL 
Parent  SC  = 2B 

Splittinq  the  CIQueue:  (29) 

Couplet  inq  constituents  for  the  C1COHPLPTEQ:  (25) 

Doinq  arc:  (POP  --  (from  TRIP/POP  425)) 

Savinq  scoped  action  9 for  Sconfiq  30 
with  scope  T 

Savinq  scoped  action  10  for  Sconfiq  30 
with  scope  POP 

Creatinq  SCONFIG  qroup: 

30  ; T - DET-  SG/  ( 0)  -TRIP/POP  (5) 

Reqs  = (NHODS  ( (PP  (PREP  TO)  (NP  (NPR  (LOCATION  (AHBIG  WASHINGTON))))) 
(PP  (PREP  FOR)  (NP  (NPR  (NAHE  (FIRST  BI LL) ) ) ) ) ) SEHLINKS  ((DESTINATION 
( ! THE  A0014  LOCATION  ((AHBIG  WASHINGTON))))  (TFAVELER  (!  THE  A0013 
PERSON  ((FIRSTNAHE  BILL)))))  NHODS  ( (PP  (PREP  FCR)  (NP  (NPR  (NAHE 
(FIRST  BILL))))))  DET  (DET  (ART  THE))  PERSON  (NIR  (NAHE  (FIRST  BILL))) 
PERSONHOD  T DET  (DET  THE)  SEHLINKS  ((TRAVELER  (!  THE  A0013  PERSON 
((PIRSTNAHE  BILL)))))  SEHQUANT  THE  NO  SG  HEAD  ( H TRIP)  SEHHEAD  DB/TRIP) 
States  = TRIP/POP  TRIP/DATE?  TRI P/TO-FOB-BOD  TBIF/TO-HOD  TRIP/HEAD 
T-DET-SG/  T-DET-SG-  DEF/  T-SG-NON-DEH/  T-SG-POSS?/  T-SG/DET 
Scopedacts  = (9  10) 

Parent  SC  = 29 

Doinq  scoped  action  9 for  SC  30 

in  arc  (POP  --  (fro*  TRIP/POP  425)) 

settinq  SEHLINKS  to  ((TRAVELER  (!  THE  A0013  PERSON  ((PIRSTNAHE 
BILL))))  (DESTINATION  (!  THE  A0014  LOCATION  ((APEIG  WASHINGTON))))) 
settinq  INTERP  to  (!  THE  A0015  DB/TRIP  ((TRAVELER  (!  THE  A0013 
PERSON  ((PIRSTNAHE  BILL))))  (DESTINATION  (!  THE  A0014  LOCATION  ((AHBIG 
WASHINGTON)  ) ) ) ) ) 

settinq  ALIFTLIST#  to  (INTERP) 
liftinq  reqister  INTEFP- 

Creatinq  direct  connection  to  new  constituent  fees: 

14:  S/(0)  -START/  (-) 

Reqs  = NIL 
States  = S/ 

Scopedacts  = NIL  -l’l- 

P<?  rent  SC  = 1 3 
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Creatinq  constituent 


from  Sconf 


iq  30: 


Semantic  representation  of 
"BILL'S  TRIP  TO  WASHINGTON". 


( ! THE  A0015  DB/TRIP  ((TRAVELER  (!  THE  A001  3 PEfSCN  ((PIRSTNAHE  BILL))) 
(DESTINATION  (!  THE  A0014  LOCATION  ((AHBIG  WASHINGTON)))))) 


) 


NP  i) E T ART  THE 

N TRIP  „ 

PP  PHEP  FOB  -Syntactic  tree  for 

NP  NP R NAME  P IP  ST  BILL  ' BIU-  S ™IP  ™ WASHINGTON". 

PP  P PEP  TO 

NP  VP  R LOCATION  AHBIG  WASHINGTON 
FEATS  NU  SG 


Doinq  arc:  (PUSH  T-DET-SG/  — (froa  START/  357)  (to  S/DCL- SUB  J- SG  203)) 
settinq  SEMQUANT  to  THE 
settinq  SEHVAR  to  A0015 
settinq  SENHEAD  to  DB/TRIP 

addinq  to  left  of  S ENL  INKS  to  qet  ((TRAVELIR  (!  THE  A0013  PERSON 
((PIRSTNAME  BILL) ) ) ) ) 

addinq  to  left  of  SEHLINKS  to  qet  ((DESTINATION  (!  THE  A0014 
LOCATION  ((AHBIG  WASHINGTON))))  (TRAVELER  (!  TH  f A0013  PERSON  (( 
PIRSTNAME  BILL)  ) ) ) ) 

settinq  TYPE  to  DCL 

settinq  SUBJ  to  (NP  (DET  (ART  THE))  (N  TRI  ()  (PP  (PREP  POR)  (N  P 
(NPR  (NAME  (FIRST  BILL)))))  (PP  (PREP  TO)  (NP  ( A FR  (LOCATION  (AHBIG 
WASHINGTON)))))  (FEATS  (NU  SG)  ) ) 

Creatinq  SCONPIG  qroup: 

31  : S/(0)  -S/DCL-SUBJ-SG(6) 

Feqs  = (SUBJ  (NP  (DET  (ART  THE))  (N  TRIP)  ( PP  (PREP  FOR)  (NP  (NPR 
(N  AM  F (FIRST  BILL) ) ) ) ) (PP  (PREP  TO)  (NP  (NPR  (ICCATION  (AHBIG 
WASHINGTON)))))  (PEATS  (NU  SG)  ) ) TYPE  DCL  SEHLINKS  ((DESTINATION  (! 

THE  A0014  LOCATION  ((AHBIG  WASHINGTON))))  (TFAV  HEP  (!  THE  A0013  PERSON 
((PIRSTNAME  BILL)))))  SEHLINKS  ((TRAVELER  (!  TH  I A0013  PERSON  (( 
PIRSTNAME  BILL)))))  S EMHEAD  DB/TRIP  SEHVAR  A001F  SEMQUANT  THE) 

States  = START/  S/ 

Scopedacts  = NIL 
Parent.  SC  = 14 

Extendi nq  To  Do  Queue  froa  Sc's:  (31) 

Doinq  consurainq  arcs  for  the  event. 
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Doinq  arc:  (WRD  (IS  WAS  COSTS  COST-PAST)  --  (f  ro«  S/DCL-SU  BJ-SG  20  3 

) (to  S/DCL-IS  200) ) 

settinq  V to  COST 
settinq  HEAD  to  COST 
settinq  TNS  to  PRESENT 
settinq  VOICE  to  ACTIVE 

addinq  to  left  of  SEHLINKS  to  get  ((TIHE  (AFTER  HOW))  (DESTINATION 
( ! THE  A0014  LOCATION  ( (AHBIG  WASHINGTON))))  (TFAVEtER  (!  THE  A0013 
PERSON  ( (PI RSTNAHE  BILL))))) 

Creatinq  SCONPIG  group: 

32:  S/(0)  -S/DCL-IS  (6) 

Reqs  * (SEfIL  INKS  ( (TINE  (AFTER  NOW))  (DESTINATION  (!  THE  A0014  LOCATION 
( ( AH  BIG  WASHINGTON))))  (TRAVELER  (!  THE  A0013  PIRSON  ((FIRSTNAHE  BILL))))) 
VOICE  ACTIVE  TNS  PRESENT  HEAD  COST  V COST  SUBJ  (NP  (DET  (ART  THE)) 

(N  TRIP)  (PP  (PREP  FOR)  (NP  (NPR  (NAHE  (FIRST  Bill)))))  (PP  (PREP 
TO)  (NP  (NPR  (LOCATION  ( AH  BIG  WASHINGTON)))))  (FEATS  (NO  SG) ) ) TYPE 
DC  L SEHLINKS  ((DESTINATION  (!  THE  A0014  LOCATION  ( ( AHBIG  WASHINGTON)))) 
(TRAVELER  (!  THE  A0013  PERSON  ((FIRSTNAHE  BILL)))))  SEHLINKS  ((TRAVELER 
( ! THE  A0013  PERSON  ((FIRSTNAHE  BILL)))))  S EH  HE  AD  DB/TRIP  SBHVAR  A0015 
SEHQUANT  THE) 

states  = S/DCL-SUBJ-SO  START/  S/ 

Scopedacts  = NIL 
Parent  SC  = 31 

Doinq  left  proposals  for  Iconfiq  8 
found  0 arcs  reachinq  Iconfiq. 

Doing  riqht  proposals  for  Iconfiq  8 
found  8 arcs  reachinq  Iconfiq. 


Created  new  island: 

Island  confiq  8 = 0 BILL  1 -S  2 TRIP  3 TO  4 WAS  FI NGTON  5 COSTS  6 
Left  spansflaq  = T 
Riqht  spansflaq  - NIL 
Parent  IC  = 7 


************************************************ 
* * 

* f BILL  -S  TRIP  TO  WASHINGTON  COSTS  ->  NINETY  * 

* * 
**********  *******************************  ******* 
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(K  (!  THE  A0014  LOCATION 
|R  SON  ( (FIRSTNA  HE  BILL))))) 
(»P  (DET  (ART  THE)) 

ILL)))))  (PP  (PRBP 
FEATS  (NU  SG)))  TYPE 
iK  ((AHBIG  WASHINGTON)))) 
))))  SEHLINKS  ((TRAVELER 
5 |E  DB/TRIP  SBNVAR  A0015 


u.nt-  fnr  T con  f ill  H with  word  match  (7  NINETY  f>  7 0) 
Procpssmii  event  for  iconrij 

Spa  nn  in  q flags  Left:  T Fiqht:  NIL 
Extending  To  Do  Queue  from  Sc's:  (32) 

poinn  arc:  (.10"  S/DCL-IS-ADV  - (from  S/DCL-I  S 200)  (to  S/DCL-IS-ADV 

20  1 ) ) 

Creating  SC0NF1G  group: 

33  : S/ (0 ) - S/DCL- IS- ADV  (6)  (DESTI  NATI  (N  (!  THE  A0014  LOCATIOI 

Regs  = (SEHLINKS  CJTIBB  ("TER  >O0)' ) (Dl KJJj«Tp  ((FIRSTNAHE  BILL)! 

((AHBIG  WASHINGTON))))  J*  ¥ COST  SOM  (HP  (DET  (ART  THE)) 

VOTCF  ACTIVE  TNS  PRESENT  HEAD  COST  V COSTAS  ) ) ) ) (Pp  V™* 

(N  TRIP)  (PP  (pSEP  F0^  #J!oTr  WASHINGTON)))))  ( FEATS  (NU  SG)  ) ) TYPE 
TO)  (NP  (NPR  (LOCATION  (AHBIG  M ASHI NGTON)  ))  )>  ^ (AnBIG  WASHINGTON))) 

Del  SEHLINKS  ( ( BEST: INATION  ( 1 J,  }f  SEHLINKS  ( (TRAVELE 

(JRTHFLA0013  PERSON^  ((FIRSTNAHE  BILL)))))  SEHHEIC  DE/TRIP  SBHVAR  A0015 

S/DCl-IS  S/DCt-SUBJ-SO  START/  S/ 

Scopedacts  = NIL 
Parent  SC  = 32 

Extendinq  B?Q:  (3  3) 

Creatinq  nes  SCONPIG  group  at  different  level: 

3U:  NU H/  (b)-NUN/(6) 

Reqs  = NIL 
States  = NIL 
Scopedacts  = NIL 
Parent  SC  = NIL 

ooinq  consuming  arcs  for  the  event. 

poinq  arc:  (CAT  TT»s  - (fro.  MR/  126,  (to  NUr/,  12T» 
settinq  NU  H to  90 

Creatinq  SCONFIG  qroup: 

35: NUN/  (6)  -NON/1  (7) 

Raqs  = (NUtl  90) 

States  * NU  H/ 

Scopedacts  s NIL 
Parent  SC  - 34 


Doinq  left  proposals  for 

found  0 arcs  reachinq  Iconfiq. 

Doinq  right  proposals  for  ^onfig  9 
found  5 arcs  reachinq  Iconfiq. 

Created  nev  island: 
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Island  config  9=0  BILL  1 -S  2 TRIP  3 TO  4 BAS  FIHGTON  5 COSTS  6 NI HETI 
7 

Left  spansflaq  = T 
Right  spansflaq  - MIL 
Parent  IC  = 8 


00 00 000 0*0*000 ****00**0*00 ******** ***¥**  ******  *********** 
♦ * 

* r BILL  -S  TRIP  TO  BASHINGTOH  COSTS  HIM  ITT  ->  DOLLAR-S  * 

♦ * 
*************** ***00**00************** ******************* 


Processing  event  for  Iconfiq  9 with  word  aatch  (8  DOLLAR-S  7 8 0) 
Spanning  flags  Left:  T Right:  MIL 


Extending  To  Do  Queue  frow  Sc*s:  (35) 

Doinq  arc:  (JUMP  NUN/NUH  — (fro«  NUH/1  127)  (to  MUH/NOH  128)) 

Creatinq  SCON'IG  group: 

36:  NUH/(6)  -Nrn/NUH(7) 

Regs  = (NUM  90) 

States  = NUH/1  NUM/ 

Scopedacts  = NIL 
Parent  SC  = 35 

Splitting  the  CJQueue:  (36) 

Completing  constituents  for  the  C1C0HPLETEQ:  (3f) 

Doing  arc:  (POP  — (frora  NUH/NUH  128)) 

Saving  scoped  action  11  for  Sconfig  37 
with  scope  POP 

Creatinq  SCONPIG  group: 

37:  NUN  / (6)  - N UH/NUH  ( 7) 

Pegs  = (NUH  90) 

States  = NUH/NUH  NUH/1  NUH/ 

Scopedacts  = (11) 

Parent  SC  = 36 

Creating  SCONPIG  group  at  new  level  for  complete  constituent 
38:  NUMBER/ (7) - NUMBER/  (7) 

Regs  = NIL 
States  = NIL 
Scopedacts  = NIL 
Parent  SC  = NIL 
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Creatinq  constituent  from  Sconfig  37: 

^ Note  that  syntactic  representation  of  this 

constituent  is  sufficient  here. 

90 


Doinq  arc:  (PUSH  NUN/  --  (fron  NUMBER/  129)  (tc  HU MBER/NUM  133)) 
settinq  NUM  to  90 

Creatinq  SCONFIG  qroup: 

39:  NUMBER/  (7)  - NUMBER/  MUM  (8) 

Reqs  = (NUP1  90) 

States  = NUMBER/ 

Scopedacts  = NIL 
Parent  SC  = 38 

Extendinq  To  Do  Queue  fron  Sc's:  (39) 

Splittinq  the  CJQueue:  (39) 

Completinq  constituents  for  the  C'.COMPLETEQ:  (39) 

Doing  arc:  (POP  --  (from  NUMBER/NUM  133)) 

Savinq  scoped  action  12  for  Sconfig  40 
with  scope  POP 

Creatinq  SCONFIG  group: 

40:  NUMBER/ (7)  -NU  M BEF/ NU  H (7 ) 

Reqs  = (HUM  90) 

States  = NUMBER/NUM  NUMBER/ 

Scopedacts  - (12) 

Parent  SC  = 39 

Creatinq  SCONFIG  group  at  new  level  for  complete  constituent 
41  : AMOUNT/ (7) -AMOUNT/  (7) 

Raqs  = NIL 
States  = NIL 
Scopedacts  = NIL 
Parent  SC  = NIL 


Doing  arc:  (JUMP  AMOUNT/MOD  --  (from  AMOUNT/  1)  (to  AMOUNT/HOD  6)) 

Creatinq  SCONFIG  group: 

42:  AMOU NT/ (7) - ANOU NT/ MOD (7 ) 

Regs  = N IL 
States  = AMOUNT/ 

Scopedacts  = NIL 
Parent  SC  = 41 

Creatinq  constituent  from  Sconfiq  4 0: 

NIL 
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Doing  arc:  (PUSH  NUHBER/  — (fro»  AHOUNT/HOD  6)  (to  A HOUNT/NH H 7)) 
setting  HUH  to  90 

Creatinq  SCONPIG  qroup: 

43: AHOUNT/(7)  -ANOUNT/MJN  (8) 

Regs  = (NUH  90) 

States  = AHOONT/HOD  AMOUNT/ 

Scopedacts  = NIL 
Parent  SC  = 42 

Extending  To  Do  Queue  froa  Sc's:  (43) 

Doing  consuninq  arcs  for  the  event. 


Doing  arc:  (WRD  (DOLLAR  jOLLAR-S  BUCKS)  — (fret  AHOUNT/NUH  7)  (to 

AHOHNT/DOLLARS  5)) 

setting  NO  to  NIL 

Creating  SCONPIG  qroup: 

44: ANOUNT/(7) -ANOUNT/DOLLARS  (8) 

Pegs  = (NU  NIL  NUN  90) 

States  = AHOUNT/NUH  AHOUNT/HOD  AMOUNT/ 

Scopedacts  = NIL 
Parent  SC  = 4? 

Doinq  left  proposals  for  Iconfiq  10 
found  0 arcs  reaching  Iconfiq. 

Doing  right  proposals  for  Iconfiq  10 
found  3 arcs  reaching  Iconfiq. 

Created  new  island: 

Island  config  10  = 0 BILL  1 -S  2 TRIP  3 TC  4 WASHINGTON  S COSTS  6 NINETY 
7 DOLLAR-S  8 
Left  spansflaq  = T 
Riqht  spansflaq  - NIL 
Parent  IC  = 9 


*****************************************  4***************** 
* * 

♦ r BILL  -S  TRIP  TO  WASHINGTON  COSTS  NINETY  DOLLAR- S ->  ] * 

* * 
*****************<■***************  ********  *******  *********** 

Doing  right  end  event. 
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Trying  to  connect  to  riqht  end  of  sentence, 
extendinq  To  Do  Queue  from  Sc's:  (44) 

Doinq  arc:  (JUMP  AMOUNT/POP  — (fro»  A MOUNT/DO I LARS  5)  (to  AMOUNT/POP  8 

)) 

Creatinq  SCONFIG  qroup: 

45:  AMOUNT/ ( 7)  - AMOUNT/POP  (8) 

Reqs  = (HU  NIL  Nil  M 90) 

States  - AMOUNT /DOLLARS  AMOUNT/NDH  AHOUNT/HOD  AMOUNT/ 

Scopedacts  = NIL 
Parent  SC  = 44 

Splittinq  the  CIQueue:  (45) 

Completing  constituents  for  the  C1COMPLETEQ:  (45) 

Doinq  arc:  (POP  — (from  AMOUNT/POP  8)) 
setting  AMT  to  90 
settinq  4LIFTLIST#  to  (AMT) 
iiftinq  reqister  AMT. 
settinq  #LIFTLIST#  to  (MOD  AMT) 

Iiftinq  reqister  MOD. 

Savinq  scoped  action  13  for  Sconfig  46 
with  scope  pop 

Creatinq  SCONFIG  qroup: 

46: A MOUNT/ (7) -AMOUNT/POP (8) 

Pegs  = ( * LIFTLIST#  (MOD  AMT)  tLTFTLIST#  (AMT)  AFT  90  NU  NIL  NUN  90) 

States  = AMOUNT/POP  AMOUNT/ DOLL  ARS  AHOU  NT/NUH  A FCUNT/HOD  AMOUNT/ 
Scopedacts  = (13) 

Parent  SC  = 45 

Creatinq  direct,  connection  to  new  constituent  frcm: 

33  : S / (0 ) - S/DCL-  IS-  A DV  (6) 

R?qs  = (SEMLINKS  ((TIME  (AFTER  NON))  (DESTINATI CN  {!  THE  A0014  LOCATION 
((AMBIG  WASHINGTON))))  (TRAVELER  (!  THE  A00  1 3 P !RSON  ((PIPSTNAME  BILL))))) 
VOICE  ACTIVE  TNS  PRESENT  HEAD  COST  V COST  SUBJ  (HP  (DET  (ART  THE)) 

(N  TRIP)  (PP  (PRFP  FOR)  (NP  (NPR  (NAME  (FIRST  Bill)))))  (PP  (PREP 
TO)  (NP  (NPR  (LOCATION  ( AM  BI G W AS  HINGTO  N)  ) ) ) ) ( fEATS  (Nil  SG  ) ) ) TYPE 
DCL  SFMLINKS  ((DESTINATION  (!  THE  A0014  LOCATIOF  ((AMBIG  WASHINGTON)))) 
(TRAVFLFR  (!  THE  A0013  PERSON  ((FIRSTNAHE  BILL)))))  SEMLTNKS  ((TRAVELER 
(!  THE  A001  3 PERSON  ((PIRSTNAME  BILL)))))  SEMHEAD  DB/TRIP  SEUVAR  A0015 
SFMQII  AN  T THE) 

States  = S/DCL-IS  S/DCL-SU BJ-SG  START/  S/ 

Scopedacts  = NIL 
Parent  SC  = 32 


Creatinq  constituent  from  Scorfig  46: 
NIL 
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t 90 


Doinq  arc:  (PUSH  ANOUNT/  — (fron  S/DCL-IS-ADV  201)  (to  S/POP  276)) 
settinq  ACTION  to  (PROGN  (ADD:  ***  V ALUE  9t)  (ADD:  ***  UNITS 
DOLLARS) ) 

settinq  OBJ  to  (NP  (S  90)) 

Croatinq  SCONFIG  qroup: 

47:S/(0)-S/POP(-) 

Reqs  = (OBJ  (NP  ($  90))  ACTION  (PROGN  (ADC:  ***  VALUE  90)  (ADD:  *** 
UNITS  DOLLARS))  SEHLINKS  ( (TIDE  (AFTER  NOW)  ) (D I SU  NATION  (!  THE  A0014 
LOCATION  ((AK3IG  WASHINGTON))))  (TRAVELER  (!  TI!  I A0013  PERSON  (( 
FIRSTNAHE  BILL)))))  VOICE  ACTIVE  TNS  PRESENT  HET.D  COST  V COST  50BJ 
(NP  (DET  (ART  THE))  (N  TRIP)  (PP  (PREP  FOB)  (NP  (NPR  (NAHE  (PIRST 
BILL)))))  (PP  (PREP  TO)  (NP  (NPR  (LOCATION  (ANBIG  WASHINGTON))))) 

(FEATS  (NU  SG)))  TYPE  DCL  SEHLINKS  ((DESTINATION  (!  THE  A0014  LOCATION 
( ( AN  BIG  WASHINGTON))))  (TRAVELER  (!  THE  A0013  PIBSON  ((FIRSTNAHE  BILL))) 
SEHLINKS  ((TRAVELER  (!  THE  A0013  PERSON  ((FIRSTNAHE  BILL)))))  SEHHEAD 
DB/TRIP  SENVAR  A0015  SENQUA NT  THE) 

States  = S/DCL-IS-ADV  S/DCL-IS  S/DCL-SUBJ-SG  ST  # BT/  S/ 

Scopedacts  = NIL 
Parent  SC  = 3 3 

Extendinq  To  Do  Queue  fron  Sc's:  (47) 

Doinq  arc:  (JUHP  END/  --  (froa  S/POP  276)  (to  IND/  78)) 

settinq  SEHLINKS  to  ((TRAVELER  (!  THE  A0013  PERSON  ((FIRSTNAHE 
BILL))))  (TI  HE  ( APTE3  NOW))  (DESTINATION  (!  THE  A0014  LOCATION  ((AH  RIG 
WASHI  NG TON)  ) ) ) ) 

settinq  INTERP  to  (!  THE  A0J15  DB/TRIP  ((TFAVELER  (!  THE  A0013 
PERSON  ((FIRSTNAHE  BILL))))  (TIHE  (AFTER  NOW))  (DESTINATION  (!  THE 
A0014  LOCATION  ((ANBIG  WASHINGTON)))))  NIL  NIL) 

settinq  INTERP  to  (FOR:  THE  A0013  / (FINDQ:  PERSON  (FIRSTNAHE 
BILL))  : T ; (FOR:  THE  A0014  / (FINDQ:  LOCATION  (ANBIG  WASHINGTON)) 

: T ; (FOP:  THE  A001S  / (FINDQ:  DB/TRIP  (DESTINATION  A0014)  (TIHE 
(AFTER  NO  ft)  ) (TRAVELER  A001  3)  ) : T ; ( PRCGN  (ADI:  A0015  VALUE  90) 

(ADD:  A001S  UNITS  DOLLARS))))) 

settinq  POPVAL  to  (S  DCL  (SUBJ  (NP  (DEi  (AFT  THE)'  (N  TRIP)  {PP 
(PREP  FOR)  (NP  (NPR  (NAHE  (FIRST  BILL)))))  (PP  (PREP  TO)  (NP  ( NPR 
(LOCATION  (ANBIG  WASHINGTON)  )))  ) (FEATS  (NU  SG))))  (AUX  (TNS  PRESENT) 
(VOICE  ACTIVE))  (VP  (V  COST)  (3BJ  (NP  ($  90))))) 
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Creatinq  SCONPIG  qroup: 

48:  S/ (t* ) -END/ (8) 

Reqs  = (POP  V AL  (S  DC  L (SUBJ  ( NP  (DET  ’ A RT  THE))  (N  TRIP)  (PP  (PREP 
FOR)  (NP  (NPR  (NAME  (FIRST  BILL)'*))  (PP  (PEEP  TO)  (NP  (NPH  (LOCATION 
(A1BIG  WASHINGTON)))))  (FEATS  (HD  SG)  ) ) ) (AUX  (INS  PRESENT)  (VOICE 
ACTIVE))  (VP  (V  COST)  (OBJ  (NP  (S  90)))))  INTER!  (FCR : THE  A0013  / 

('.  yhq:  person  (FIRSTNAME  BILL))  : T ; (POR:  TH  ! A0014  / (PTNDQ: 

LO  TON  (AMBIG  WASHINGTON))  : T ; (FOR:  THE  A0f15  / (FIN  DO : DB/TRIP 
(Dl  NATION  A0014)  (TIME  (AFTER  NO *)  ) (TRAVELER  A00  1 3)  ) : T ; (PROGN 

(ADD:  ; 00  IS  VALUE  90)  (ADD:  A001S  UNITS  DOLLARS)))))  INTERP  (!  THE 
A 0 0 1 5 DB/TRIP  ((TRAVELER  (!  THE  A0013  PERSON  ( ( I IPSTN  AHE  BILL)))) 

(TINE  (AF^FP  NOW))  (DESTINATION  (!  THE  A0914  LOCATION  ((AMBIG  WASHINGTON)))) 

**) 

NIL  NIL)  SEMLIH  S ( (TRAVELER  (!  THE  A0013  PERSOF  ((FIRSTNAME  BILL)))) 

(TIME  (APTEP  NOw|  ) (DESTINATTON  (!  THE  A0014  LOCATION  ((AMBIG  WASHINGTON)))) 

♦*) 

CBJ  (NP  (S  90))  ACTION  (PROGN  (ADD:  ***  VALUE  90)  (ADD:  ***  UNITS 
DOLLARS))  SEMLINKS  ((TIME  (APTER  NOW))  (DESTINATTON  (!  THE  A0014 
LOCATION  ((AMBIG  WASHINGTON))))  (TRAVELER  (!  THE  A0013  PERSON  (( 

FIRSTNAME  BILL)))))  VOICE  ACTIVE  TN  S PRESENT  HEAD  COST  V COST  SUBJ 
(NP  (DET  (APT  THE)  ) ( N TRIP)  (PP  (PREP  FOR)  (NP  (NPR  (NAME  (FIRST 
BILL)))))  (PP  (PREP  TO)  (NP  (NPR  (LOCATION  (AMBIG  WASHINGTON))))) 

(FEATS  (NU  SG)))  TYPE  DC  L SEMLINKS  ((DESTINATION  (!  THE  f.0014  LOCATION 
((AMBIG  WASHINGTON))))  (TRAVELS*  (!  THE  A0013  PERSON  {(FIRSTNAME  BILL))))) 
SFMLINKS  ((TRAVELER  (!  THE  A0013  PERSON  ((FIRSTRAME  BILL)))))  SEMHEAD 
DB/TRIF  SEHVAR  A0015  SEMQUANT  THE) 

States  = S/POP  S/DCL-IS-ADV  S/DCL-IS  S/  CCL-  SIJ  BJ  - SG  START/  S/ 

Scopedacts  = NIL 
Parent  SC  = 47 

Doinq  consuminq  arcs  for  the  event. 


Doinq  arc:  (POP  --  (from  END/  78)) 

Savinq  scoped  action  14  for  Sconfiq  49 
with  scope  POP 


Creatinq  SCPNPIG  qroup: 

49:  S/(P)  - END/ ( -) 

FOR)  (NP  (NPR  (NAME  (FIRST  BILL))))*  (PP  (PREP  TO)  (NP  (NPR  (LOCATION 
(AMBIG  WASHINGTON)))))  (FEATS  (NU  SG)  ) ) ) (AUX  (INS  PRESENT)  (VOICE 
ACTIVE))  (VP  (V  COST)  (OBJ  (NP  (I  90)))))  INTERP  (FOR:  THE  A0013  / 

(FIN  DQ:  PERSON  (FIRSTNAME  DILL))  : T ; (FOR:  TH  l AP^IW  / (FINDQ: 

LOCATION  (AMBIG  WASHINGTON))  : T ; (FOR:  THE  A0  015  / (FTNDQ:  DB/TRIP 
(DESTINATION  A0014)  .TIME  (AFTER  NOW))  (TRAVELER  A0P13))  : T 'PROGN 
(ADD:  A 00 15  VALUE  90)  (ADD:  A0015  UNITS  DOLLARS)))))  INTERP  (:  .HE 
A0015  DB/TRIP  (( TRAVFLER  (!  THE  A0013  PFRSON  ( ( FIPSTN  AMF  BILL)))) 

(TIME  ( APTE  P NOW))  (DESTINATION  (f  THE  A0014  LOCATION  ({AMBIG  WASHINGTON)))) 

**) 

NIL  NIL)  SEMLINKS  ((TRAVELER  (!  THE  A0013  P EF  SO  R ((FIRSTNAME  BILL)))) 
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(TIME  (AFTER  NOW))  (DESTINATION  (!  THE  A0014  LOCATION  ( (AMBIG  W AS  HI  NGTON1 ) ) ) 

**) 

OBJ  (NP  ($  90))  ACTION  ( PROGN  (ADD:  ***  VALUE  9e)  (ADD:  ***  UNITS 
DOLLARS))  SEHLINKS  ((TIME  (AFTER  NOW))  (DESTINATION  (!  THE  A0014 
LOCATION  ((AMBIG  WASHINGTON))))  (TRAVELER  (!  TH  l A0013  PERSON  (( 

FIRSTNAME  BILL)))))  VOICE  ACTIVE  TNS  PRESENT  HEAD  COST  V COST  SUBJ 
(NP  (DET  (ART  THE))  (N  TRIP)  (PP  (PREP  POP)  (NP  (NPR  (NAME  (FIRST 
BILL)))))  (PP  (PREP  TO)  (NP  (NPR  (LOCATION  (AMB IG  W ASHINGTON) ) ) ) ) 

(FEATS  (NU  SG)  ) ) TYPE  DCL  SEHLINKS  ((DESTINATION  (!  THE  A0014  LOCATION 
((AMBIG  WASHINGTON))))  (TRAVELER  (!  THE  A00  13  PIRSON  ((PIRSTNAHE  DILL))))) 
SEHLINKS  ((TRAVELER  (!  THE  A0013  PERSON  ( (FIRST  NAME  BILL)))))  SEHHEAD 
DB/TRIP  SEMVAR  A0015  SEHQUANT  THE) 

States  = END/  S/POP  S/DCL-IS-ADV  S/DCL-IS  S/DCL-SUBJ-S G START/  S/ 

Scopedacts  = (14) 

Parent  SC  = 48 

Creating  constituent  from  Sconfig  49: 


(FOR:  THE 
: T 


) 


A0013  / (FINDQ:  PERSON  (FIRSTNAME  Bill)) 

; (FOP:  THE  A0014  / (FINDQ:  LOCATION  (AHBIG  WASHINGTON)) 

: T ; (FOR:  THE  A0015  / (PINDC:  DB/TRIP 

(DESTINATION  A0014) 
(TIME  (AFTER  NOW)) 
(TRAVELER  A0013)) 

: T ; (PROGN  (ADD:  A0015  VALUE  90) 

(ADD:  A0015  UNITS  DOLLARS)))) 


S DCL 

SUBJ  NP  DET  ART  THE 
N TRIP 
PP  PREP  FOP 

NP  NPR  NAME  FIRST  BILL 
PP  PREP  TO 

NP  NPR  LOCATION  AHBIG 
FEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V COST 

OBJ  NP  t 90 


SCONFIG  49  covers  the  entire 
sentence  and  will  bt  recognized 
as  a complete  parse  on  the 
next  page. 

WASHINGTON 
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Pound  complete  parse: 

(POR:  THE  A0 0 1 3 / ( FINDQ:  PERSON  ( PTRSTN AM  ! BILL') 

: T ; (POR:  THE  A00 1 4 / (FINDQ:  LOCATION  (AMBIG  WASHINGTON)) 

: T ; (POR:  THE  A0  015  / 

( PIN  DQ : DB/TPIP  (DESTINATION  A0014, 
(TIM  I (AFTER  NOW)) 

(TRAVELER  A00  1 3)  ) 

: T ; 

(PROG N (ADD:  A00 1 S VALUE  90) 

(ADD:  A 00  1 5 UNITS  DOLLARS))))) 

S DCL 

SUBJ  NP  DET  ART  THE 
N TRIP 
PP  PREP  FOR 

NP  NPR  NAME  FIRST  BILL 
PP  PREP  TO 

NP  NPR  LOCATION  AMBIG  WASHINGTON 
PEATS  NU  SG 
AUX  TNS  PRESENT 
VOICE  ACTIVE 
VP  V COST 

OBJ  NP  t 90 


Doinq  left  proposals  for  Iconfiq  11 
found  0 arcs  reachinq  Iconfiq. 

Doinq  riqht  proposals  for  Iconfig  11 
found  0 arcs  reachinq  Iconfig. 

Created  new  island: 

Island  confiq  11  = 0 BILL  1 -S  2 TRIP  3 TO  4 WASHINGTON  5 COSTS  6 NINETY 
7 DOLLAR-S  B 

Left  spansflaq  = DONE  <: -indicates  a complete  parse  has  been  found. 

Right  spansflaq  - T 
Parent  IC  = 10 
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